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business problem. Our authors describe the analysis, conversion, and testing process. 
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by Brand Former 
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In embedded-controller projects with execution time constraints, it’s okay to sacrifice 
memory for speed. The power-of-two Fast Memory Allocator (FJV1A) Thomas presents 
here is used in just such a project, 
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ACTIVE DATA OBJECTS & ASP 88 

by Mark Betz 

Active Server Pages are useful for generating output and managing application state on 
behalf of a client. When combined with Active Data Objects, your scripts can manipulate 
ODBC data sources to do nearly anything dial is possible in native SQL, 
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Speed is essential in all database projects, but not at theexpense of stability. You 
wouldn't try to go 100 miles per hour with your bicycle! The same is true in database 
technology, FairCom has been delivering fast, safe, full-featured database engines to 
the commercial marketplace for 19 years. Proven on large Unix servers and 
workstations, c-tree Pius's small footprint and exceptional performance has also 
made it the engine of choice for serious commercial developers on Windows and 
Mac. Check out www.faircom.com for detailed information. You'll be glad you did. 
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Half of your Client/Server project is the Server! You control 100% of your Client 
Side. Why settle for less on your Server side? Move your functions to the server- 
side to decrease network traffic and increase performanceI 


Today's database demands may often be too complex for traditional Relational Model 
Database Servers. Server needs come in many different sizes and shapes, What 
better way to accommodate these requirements than by allowing the developer to 
take full control of the Server side? FairConYs Server Development System was 
created to meet this need, ft provides the developer the means to create an 
industrial strength Server. Complete make-files are included for all FairCom 
commercial platforms. With our proven kernel add or override existing database 
functionality or create your own special multi-threaded server: 

Application Server Network Gateway Server Data Warehouse 

Special Web Server Departmental Database Server Embedded Servers 


FairCom Server Development System key features: 

Provides complete source code for ell the interface subsystems to the FairCom 
Server. Server mainline, Communication. Threading, Remote function interfaces and 
procedure calls are all supplied in complete C source code together with the 
FairCom Server sophisticated thread-safe kernel libraries. 
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X Finally, someone 
got the message. 

You’re not alone anymore! Stingray’s MFC Classes are here to help! 



S ome Limes when developing software you can feel lost at sea. 
You know Llie great features you want in your applications, but 
are stranded due to limitations in MFC Stingray Software can 
save you with a family of MFC extensions that add sophisticated 
GUI functionality in just minutes. 

Have you ever tried to add docking windows, like Microsoft 
Developer Studio? What about shortcut bars like Microsoft Outlook? 
Or multi-selection tree controls? Do you want VI “cool style ' 
toolbars with drag and drop customization without spending 
months coding? Objective Toolkit™ provides these and over 70 
extensions that add the coolest GUI functionality - in just a fraction 
of the time it would take you to build them from scratch. New 
Objective Toolkit PRO™ goes beyond Objective Toolkit Lo 
solve some of the more complex problems in the MFC architecture. 


With a family of over 
a dozen products. 

Stingray Software is fast 
becoming the one-stop 
shop for the object-oriented 
developer. 

To find out how Stingray Software 
can rescue you, surf Lo our web 
site for free demos, evaluation 
copies and white papers. 



All Stingray libraries come with PULL SOURCE CODE at NO 
ADDITIONAL CHARGE and have a 30 day money hack guarantee. 


If you need grids and charts to present different views of your 
data— but the schedule is just too tight, then Objective Grid™ 
and Objective Chart™ are for you. You save hundreds of hours 
by using the built-in wizards and now Objective Grid has complete 
Excel formula support. 

Do you want to provide graphical layouts in your applications? 
Don't spend months struggling with the GDI. use Objective 
Diagram ™. It comes complete with zoom and print support plus 
OLE automation to transfer your layouts to other programs. 

When you need to provide a full-featured color syntax editor, 
don't spend months writing code — turn to Objective Edit™ . 



Stingray Software, Inc, ■ 800-924-4223 * 919-461-0672 • email: sales@stingray.com 

www. sti ng ray. com 

All products and brand names are trademarks and/or registered trademarks of their respective holders. 
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EDITORIAL 


Fear and 
Loathing 
on the 
Y2K Trail 


C ontrary to what you’ll read elsewhere in this journal. the Year 2000 crisis has nothing to do 
with conversion, Cobol, or computers, No, the real problem is that the Year 2000 is an 
election year. And, even though the ballot boxes are more than two years from being stuffed, 
a rogue's gallery of crooks, con-men, and congressmen is already bellying up to the public 
trough. 

For its part, the federal government showed again it's on top of things by recently establisliing 
a Y2K task force to assist state and local governments, the private sector, and foreign 
governments. But no sooner had the government’s press release hit the wires than the 
Information Technology Association of America (ITAA) weighed in to take credit, stating in its 
February 4, 1998, press release that “we suggested that a commission be impaneled to increase 
attention to solutions...to Year 2000 challenges." ITAA president Harris Miller went on to say that 
“the true count down to the Year 2000 begins today.* 

Sony' Harris, but in the real world, the Year 2000 countdown began a long time ago. In fact, 
according to those in the Y2K trenches, any yet-to-start project that wall take more than 1.5 years 
to complete won’t be finished in time. Underscoring this point, consulting firms specializing in 
Y2K conversion aren’t taking on new- projects that start alter July 1998, And in some cases, 15, 

20, or even 50 years isn’t enough time to fix what soon will he broken. 

Take global positioning systems (GPS), for instance. Over the past few years, millions of GPS 
receivers (not to be confused with GPS ground stations or satellite broadcasters) have been sold 
to sportsmen, geologists, archeologists, the military, taxi cab drivers, and others who want to 
know where in the world they are at any particular time. As pari of the algorithm that lets the 
GPS receiver pinpoint location, GPS distributes time information* (A time differential is required 
to fine-tune rough triangulations when determining exact locations.) Since GPS time data is 
available throughout much of the world, other applications (such as financial computers) 
piggyback on GPS Lime/date information for a variety of purposes. Every millisecond, thousands 
of computers take time calibrations from GPS broadcasts for calculating interest an huge short¬ 
term electronic-funds transactions. 

But in the GPS signal standard, dales use 13 bits to represent a time-unit offset from a 
conventional epoch date consisting of two fields (epochsoffset). GPS time receivers that have 
been programmed to update the epoch field will experience only a minor hiccup (if any). On the 
other hand, receivers that have this information burned into programmable read-only memory 
(PROMs) will likely fail because, on or about August 22 r 1999, the date value will overflow this 
13-bit type as satellites broadcast a new epoch. These hardcoded epoch time subsystems will 
think lire calendar has been reset to the epoch in 1980. In short, any system that hardcodes the 
GPS epoch and is sensitive to the fact that “1980" is not “1999” will fail. Luckily for the vendors 
(but not users), most GPS time-receiver equipment is sold under a warranty that says “this 
equipment Is not warranted to he suitable for any particular purpose." 

This is only one type of problem faced by global positioning systems. There is, of course, the 
now-infamous literal value *W in date types. There are also problems with other types, 
including type overflow' problems at various dates throughout 1999; Y2K arithmetic that 
implicitly assumes dates later than December 31, 1999, are impossible; and implicit module- 
interface date-type conversions, to name a few. 

Compounding the Y2K GPS problem is that it is difficult, if noi darned near impossible, to 
construct valid GPS test cases to see what will happen at the Year 2000. Why? Because, say GPS 
experts, the future (time) states of the system depend on physical values (such as orbital 
elements and gravitational forces) that can only be accurately determined within about three 
months of the Year 2000, 

As outhouse luck would have it, the ensuing chaos will coincide with Year 2000 elections 
campaigns, in a more perfect world, errant GPS systems would deposit special-interest soft 
money into opponents’ coffers, while routing candidates to hostile speaking engagements. Ahs, 
the best we can hope for are candidates who advertise themselves as “Y2K compliant." 



Jonathan Erickson 

editor-in-chief 

jerickson@ddj.com 
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Objective Toolkit” 


Filling the holes in MFC 




The best MFC extensions money can buy 


I f you're a MFC programmer, you love \i for its object-oriented 
design — but haLe that most of today's coolest GUI features are not 
available. Features such as: docking windows like Microsoft 
Developer's Studio, '97 style cool-look dockable toolbars and menu 
bars with full customization, and shortcut bars like Microsoft Outlook. 

For years Stingray Software has been filling the holes in MFC with 
Objective Toolkit. It contains over 60 extensions that add these and 
other great features such as: multi-column multi-selection tree controls, 
enhanced tabs and custom controls such as color wells, thumbnails, 
editable list boxes, calculator, calendar, masked edits, time/date and 
currency fields. To code these with MFC would take months, but 
with Objective Toolkit you're just minutes away from these sought 
after features. 

Also chock out our Objective Toolkit Pro. a popular add-on to 
Objective Toolkit, w r ith the following advanced features: Layout 
Manager, dockable CViews, powerful scripting and forms engine, 
and a full model view controller framework — an alternative to 
document view. 



With over a dozen products. Stingray Software is the object-oriented 
programming expert. Visit our web site for demos, evaluation copies, 
white papers or more information on this or any Stingray product. 


All Stingray libraries come with FULL SOURCE 
CODE at NO ADDITIONAL CHARGE and have a 
30-day money back guarantee. 


Stingray Software, Inc. * 800-924-4223 • 919-461-0672 * email: sales@stingray.com 

WWW- st i n g ray. co m/otmfc 

All products and brand names are trademarks and/or registered trademarks of their respective holders. ^EZcS 
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Behold the Sun " Ultra'" 5 and Ultra 10. Together, they represent a major evolutionary leap 


over all lower forms of workstations. Because they’re the first full-fledged Sun workstations 


(with up to a 300MHz processor, no less) that'll let you run all your favorite PC apps. All while 


delivering the power, scalability, networkability, and proven robust UltraSPARC'VSolaris’" per- 
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formance you've come to expect from Sun. And all, believe it or not, starting at Just $3,895. 


(Roughly 13% less than a comparably equipped Compaq* system.) With these workstations. 



fixing ttdsfcl cT MSRP as at 2/B8. COFfi h nf Certain minlelt npl mHBbln Iw wee [MchflK. CIBSfl Sun MpcrnsvElBrni, Inc. M nghti rjgRriRil. Sun, Sun 
Ml SPtfli: trademarks sue ustO unfc Htuftnf and air tradmrurta nr registered IrWamnrks of 5W(C MCTWtlWHl, Inc In the Unled 51SK aid oUnf counlrieii. 
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EW SPECIES 


you can run heavy-duty applications one moment, then craft presentations using Microsoft® 



Office® the next. What's more, the Ultra 10 with its new Elite3D graphics will 


blow away a similar SGI machine, And at less than a third of the cost. And 


since both Sun models are binary compatible, they're perfect entry points 


to our full line of Sun systems (which, with up to 64 processors, can expand 


to meet anyone’s needs). Both ready to run over 12,000 applications available for Sun 


(And 


now every PC application beneath it.) All of which just may make them the missing links you’ve 



been looking for. For more information, call 800-SUN-FIND for a Sun reseller or represen¬ 
tative near you. Or visit our Web site at sun.com/ult/ntc. THE NETWORK IS THE COMPUTER. 1 " 



microsystems 
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Resizable Data Structures 

Dear DDJ 

In the January 1998 issue of DDj , John 
Boyer discusses resizable data structures 
in “Algorithm Alley." Under the heading 
“Java and the STL,” he discusses the Java 
hash table and the C++ STL Vector class. 

I would like to add that Java in the 
java.util package also has a Vector class 
available. This Java Vector class is a fully 
resizable data structure. Its elements can 
be any object, including other vectors. 

Using the constructor VectorfO );, Lhis 
data structure uses array doubling as a ca¬ 
pacity strategy, but any other strategy can 
be used by setting the iniUalCapacity and 
capacity hi c rement pa rameters. 

Using the irimToSize method of tliis class 
trims the capacity to die vector’s current 
size, thus allowing for die shrinkage of the 
data structure that John is looking for. 

More flexible shrinkage strategies, like 
shrinking the array to half size when it’s 
one-quarter full, as John proposes, can lie 
obtained by using Vectors trimToSize 
method followed by calling the etisure- 
Capacity method with an appropriate ar¬ 
gument. For the “one-quarter full” strate¬ 
gy, this argument would lx? two times the 
current vector size. This would look some¬ 
thing like Ex^i m pie 1, 

The availability of a fully resizable data 
structure (growing, shrinkage, different 
strategies) in Java puts the possibilities of 
this language in the right perspective. 
Alexander G. Vandervoort 
University of Groningen 
a.g.van.der.voort@bdk.mg.nl 

Dear DDJ, 

In his excellent “Algorithm Alley" (DDJ, 
January 1998), John Boyer pointed out 
that resizable arrays should always be 
grow n or shrunk by a factor of two. There 
are two additional comments which 1 
would like to make: 

• Tlie factor which is used does not have 
to be two. The only important point is 
that the array size should be multiplied 


by a certain factor rather diaii increased 
by a fixed amount. One can choose a 
larger factor than two to improve exe¬ 
cution times, or a smaller factor to re¬ 
duce memory overhead. However, if a 
factor smaller than two is used, care 
should be taken to ensure that the ar¬ 
ray grows by at least one element when 
resized. 

* Tins technique delivers amortized lin¬ 
ear time complexity. Tills means that the 
average time required to insert an ele¬ 
ment is constant. However, the maxi¬ 
mum time required still grows linearly 
with the number of elements. This may 
not be acceptable for some applications; 
however, in most cases, it should not 
lie a problem. 

Surprisingly, this powerful yet simple 
technique is very much underused. The 
number of programs that resize their ar¬ 
rays using fixed increments is staggering. 
As 1 found out recently, it even happens 
to the software giants. I wrote a Win32 
program that tries to create as many win¬ 
dows as possible. Under Windows NT 4,0, 
the elapsed time increases quadratically 
with tlie number of windows created; it 
is fairly safe to conclude dial this is caused 
by an array that is being grown by fixed 
amounts. (If anyone ai Microsoft is read¬ 
ing tills, please fix this in the next release.) 
Interestingly, tlie supposedly less power¬ 
ful Windows 95 does not have this prob¬ 
lem, 

Martin Boehme 

boehme® inform adk. mu-1 u e beck, de 

More Shunkworks 

Dear DDJ, 

It was interesting to see the letter from 
Quinn, Emanuel, Urquhart, & Oliver, IXP 
in die “Letters” section of DDJ, March 1998. 
To add to the lore, according to the Trade¬ 
mark & Copyright Journal, VoL 55 No. 1353 
(pp, 82), Lockheed Martin sued Network 
Solutions Inc, (the domain name registrar) 
for issuing domain names such as 
“sku nk works. com, n "sku nkworks, net," 
“skunkwiks.com," and “sktJrikwerks.com. ” 
In a nutshell, “Lockheed sued NSI for 
trademark infringement, unfair competi¬ 
tion, dilution, and contributory trademark 
infringement/ 1 Unfortunately for Lockheed, 
the "court granted NSEs motion for sum¬ 
mary judgement on all claims,” 

Jeff Clausius 

iome tries @io metri cs. com 


PTL Namespaces 

Dear DDJ 

I really liked Al Stevens’ column about FIT 
in DDJ, March 1998.1 felt compelled to sug¬ 
gest that a namespace name that no one 
else would dream of using might he “Al- 
StevensFTL” localise if Microsoft or AT&T 
can do it, so can he. Besides, I like the 
way it looks and 1 always enjoy APs arti¬ 
cles. In my opinion, he deserves it. 

John O. Viessdman, M.D. 

doctorv ©doctorv; com 

Checking on VerCheck 

Dear DDJ 

Although the VerCheck utility presented 
by John Graham-Gumming in DDj, March 
1998 is certainly very useful, I think a few, 
but important, comments are in order. 

1 also had to deal with this problem in 
the past and spent a few hours modifying 
Jeffrey Richter’s well-known Vershow util¬ 
ity, l could notice a few problems with the 
way some programmers (especially at Mi¬ 
crosoft) use VERSION INFO, When you 
open the Properties dialog box in the Ex¬ 
plorer by right clicking a filename, you get 
a Version tab that gives you the file ver¬ 
sion number. Like the VerCheck utility and 
unlike Vershow, this file version number 
is the “string" version number, not the 
“real" version number found in the 
VS.FIXEDITLEINEO data structure. 

So what? Well, I have noticed that it hap¬ 
pens more often than expected that the 
string version number doesn’t match the 
actual (numerical) version number, espe¬ 
cially for system files distributed by Micro¬ 
soft. it seems that tlie developers foe re of¬ 
ten forget to keep both entries in synch 
(this is done automatically in the the Vi¬ 
sual Studio Resource Editor but not nec¬ 
essarily in all other development environ¬ 
ments), So far as I know, installation 
utilities use the numerical version num¬ 
ber, not the string value. At least, I hope 
so. Therefore, I think that any utility giv¬ 
ing access to the file version number (in¬ 
cluding the Explorer itself), should also 
use the numerical file version number 
Otherwise, this may lead to problems like 
replacing a newer file with an older file, 
Using Lhe string version number is easier, 
but from far less reliable. 

By tlie way, another problem with ver¬ 
sion stamping at Microsoft: The vast ma¬ 
jority of the Win32 SDK include files don’t 
contain any version information, f thought 
that Visual SourceSafe macros where able 


if (myVector.size() >= myVectar*capacity() / 4) ( 

myVector. triinToSizeO : 

myVector.ensureCapacity£2 * myVector.size £)); 

] 


Example 1 ; Resizable data structures 


XO 


Dr: Dobh } sJournal, May 1998 











Making software work together™ 

Making software work together - it’s that simple. No matter how or where your systems are implemented, 
OrbixOTM will bring them together. Mainframe to Windows! Unix to Web - application integration has never 
been easier. Which allows yon to focus on what really matters - using the best technology in the right place. 
And OrbixOTM lets you evolve your systems over time, adding new ideas to the reliable solutions that your 
business is built on. 

But OrbixOTTVI does more than just integrate. It provides a complete enterprise solution. It features extensive 
and natural support for transactions, security, and systems management. There's advanced deployment 
support, load balancing and fault resilience. Which all means distributed object technology is now a long way 
from the lab. It's providing competitive advantage in the real world. 

But then chances are your competition already knows that 
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(continued from page 10} 
to maintain this automatically. This is sur¬ 
prising, Any professional developer will 
add version information in the include file 
headers he writes, even for small programs. 
I can’t imagine what good reason they have 
for not including version information in 
such important files. Since these include 
files may come with various products 
(Win32 SDK, VC++, Borland C++, and so 
on), it is often very difficult to determine 
which file is die latest and greatest when 
you install several of these products on 
your system. You absolutely can i rely on 
the time stamp. About two years ago 
(95/96). 1 received a new version of die 
Win32 SDK where all include files were 
dated 1993! Since then, I don’t even look 
at the time scamp when comparing include 
files from Microsoft, 

Patrick Philippot 

patrick. phi lippot@ in ame .com 

John responds: Thanks for your note, 
Patrick. My intent was to replace the long 
and arduous task of gathering the version 
numbers from Properties over the phone, 
1 agree with your analysis concerning the 
string versus numeric version numbers 
and VerCheck could lie improved to gath¬ 
er both quite easily. Also, I use the $ His¬ 
tory $ macro for automatic maintenance. 


This gives the full file history as well (as 
long as the developer bothers to write it). 

Source Code Comparison 

Dear DDJ, 

In reading DDJ, February 1998,1 noticed die 
“News & Views” item on MOSS, the pro¬ 
gram that provides a statistical comparison 
of source code, so that professors can de¬ 
tect cheating among programming students. 

While 1 agree dial this program could 
be useful, and that academic fraud should 
not be allowed, I know from personal ex¬ 
perience that sucli similarities should not 
automatically be construed as fraud. 

My fresliman year in college, I was tak¬ 
ing a Pascal course, and doing quite well. 
The day after submitting a project, I was 
summoned into an office with die Dean, 
my professor, and a classmate. The profes¬ 
sor placed our respective projects side by 
side, and asked us to explain ourselves. Our 
code was virtually identical, with only mi¬ 
nor differences in comments and variable 
names. We both admitted that the programs 
looked identical, and I Lhink die Dean was 
ready to expel us both Fortunately, we were 
both fairly meticulous notekeepers, and 
could fully document our development 
progress. Combined with an impromptu 
ccxling quiz in the Dean's office, we both 
managed to stay in school, although she 


and 1 were required never to be in the pro¬ 
gramming lab together or to .send e-mail to 
one another, and we lx>th got more than 
the average stares during tests. Eventually. 
1 think the professor w r as fully convinced 
that our identical code was just a fluke. 

For small projects, with programmers 
of similar training and experience, the like¬ 
lihood of running across very similar code 
is higher than most people seem to Lhink. 
Byron Jones 
by ron j ©fdpcorp. com 

Yet Another Profiler 

Dear DDJ, 

Embedded in die Watcana C/C++ 11,0 com¬ 
piler is a second profiler not mentioned by 
Ron van tier Wal in his article “Source-Code 
Profilers for Win32” ( DDJ March 1998). 
When you compile the source code with 
the w -ef switch on the command line, the 
compiler inserts timing code at the start 
and exit of all functions, and it creates a 
LOG file for each program and DLL after 
each run. I have found this option to be 
more convenient and giv ing more accurate 
results than the stand-alone profiler that 
comes with Wat com. 

Thiadmer 
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With Track Record,” the top-rated tool from the creators of BRIEF 

To kill bugs fast,, you have to stay an lop of thorn. Yu a need la know where they are, 
who's responsible for them and lots of other details. Keeping track of oil this infer mo 
tion can drive you crazy, Thai s where Track Record can help. Only Track Record offers: 

* A hacking system that keeps you on lop of oil your 
project details—like bugs, features and releases 

* Seamless integration with Visual SourceSafe, 

PVCS and MS Source Integrity 

* Built-in notified I ion via dynamic reports 
os well as optional email notification 

* A fully customizable database 

* Optional Web browser interface 

* Detailed histories of work that's been done 

* A complete client/server version 
For your important projects, join the more than 20,000 satisfied 
users who trust Track Record to help them gel the hugs out. You'll 
see why it's the proven market leader, rated #1 by more reviewers 
and users than any alher bug tracking system! 

Order Tattay! Just $195 with a DjjJJjuj 

Risk-Free, Money-Back Guarantee! 

Call (800) 343-7308 
or (617) 267-0743 
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IMAGE PROCESSING 


TRANSFORMS 

Including resize, resample (Interpo¬ 
lated resize), rotate {.01 degree), flip. 
Invert, reverse, crop, undelay, shear, 
transpose, fill auto deskew and com¬ 
bine bitmap (with mathematical and 
Boolean operations]. 


FILTERS 

Including sharpen, blur, brighten, 
darken. Invert, hue and saturation, 
Intensity, contrast, gamma; 
correction, histogram equalize, edge 
detect, line detect, emboss, mosaic, 
posterize, median and noise fitters, 
spatial filter [which can be pre¬ 
defined such as gradient, laplacion, 
sobel, prewItt, shirt and difference, 
line segment, or they can be 
customized}, and more. 

DRAWING 

Draw directly to a bitmap surface 
using any of the windows GDI 
functions [such as TexfOut Bit Bit, 
Ellipse, and Rectangle), 


REGION OF INTEREST 

Process only a specific portion of an 
image rather than the entire bitmap. 
Regtons can be comprised of any 
combination of rectangles, ellipses, 
rounded-rectangles, freehand 
shapes, polygons, transparent color 
and more, 
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ADDING IMAGING? 

not without 
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If you have an Imaging project In front of you, choose the #1 toolkit in the market. 
LEADTOOLS is an award winning imaging toolkit that puts more than 7 years of 
development and millions of lines of code at your finger tips. You will save countless 
hours and have access to the best imaging technology available, imaging 
technology that Microsoft, Hewlett Packard, Caret, Xerox and thousands of others have 
chosen for their products, With LEADTOOLS, your project is almost done! 


VIEWING b SPECIAL EFFECTS 

LEADTOOLS provides optimized rendering 
of any image to oil display devices with 
monitor calibrations, auto-dithering, 
scaling, zooming, scrolling, and animation, 
Choose from 113 Point Effects, 64 Dissolve 
Effects, and 24 Transition Effects, including 
pushes, pulls, wipes, splits, binds, crushes, 
rolls, circulars, diagonals, stretches and 
many more with delays, grain sizes, pattern 
bushes (up to 64 passes), o colored wood 
and a transparent color, The combinations 
are unlimited. Other Special Effects 
include: 3D Shapes, 3D Rotated Text, 3D 
Frames, Gradient and Pattern Filled shapes 
and much more. 

INTERNET/INTRANET 

LEADTOOLS features a Net Aware ActiveX 
and a Netscape plug-in for 
Internet/lntranet applications. Including a 
Bitmap Datapath allowing Images to be 
read from any URL Progressive JPEG, 
Progressive CMR and support for GIF 
interlace, transparency, animation, and 
embedded text, A FeedLood function has 
been created to allow image data to be 
displayed as it Is being transmitted across 
the net, 

DATABASE 

LEADTOOLS has specific features designed 
for the Imaging database developer: VB 
data binding, 32-btt ODBC, a customized 
OLE 2,0 In-place server. Load/Save 
memory, and Load/Save file offset. 

LEADTOOLS a cocked up Dy o SO-dav money tXK* 
guomtee (US & Canotto or*/) sod free technical &«pori is 
ouaiiofcte vto phone, tax, interne!. CompuServe or BBS. 


SCANNING 

LEADTOOLS supports high speed scanning 
using ISIS and TWAIN, TWAIN Includes both 
16 & 32 bit native and buffered RAM 
transfer modes, 

PRINTING 

LEADTOOLS performs all image processing 
necessary to print directly to any Windows 
supported printer, with the ability to print 
text and multiple imoges on the same 
page 

COMPRESSION 

LEADTOOLS offers more compression 
options than any other toolkit on the 
market, In both standard and proprietary 
formats. 

THE POWER OF LEADTOOLS 

LEADTOOLS is a collection of more than 
400 functions, properties and methods 
that provide low level functions for 
complete control, all the way to the 
highest level functions for ease of use. 
Imaging Gammon Dialog Boxes, (new to v 
9,0) greatly simplifies integration. Royalty 
free and available as 16 & 32 bit DLLs. 16 & 
32 bit ActiveXs, and a VBX and Includes 
extensive source code examples for 
Microsoft Visual C/C ++, Borland C/C++, 
Microsoft Visual Basic, Borland 
DelphLMicrosoft Visual FoxPro, Microsoft 
Access, VBond Java Script, 

NOW FEATURING ItflflTBtlS I Hi I MS 

EASYTOUSS ; g|itii]lxiii:i 

^DIALOGS 





“LF.ADTOOLS is mn indispensable 
tool, we used it jn die development of 
our Frontpage applica tion". 

Tom Button 
Director of Marketing, 
internet Ptaiform di Ttioist division 
Microsoft 


The tile format support la phenome¬ 
nal, LEADTOOLS gave Mterografx's 
applications (like CicnEaCiird™ and 
Piciun: Publisher™) the risibility U> 
import and e*pon a wide variety of 
■eslabliAhtui and new flic formats with 
ease. Their formal implemcnlalions 
arc complete and performance Great!" 

Andy Cohen 
Director Software Development 
Consumer Product* 
Mtcrografx 
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MS Visual Studio 97 Pro 
Version Upg 

by Microsoft Corporation 

*PhcB after $ 100 manufacturer's rebate. 


Programmer's Paradise 

the developer's definitive source for software! 


WISE Installation 
System Enterprise 
Edition 

by WISE Solutions 
The WISE installation System 
Enterprise Edition incorporates the 
award-winning WISE Installation 
System, and the tools required for 
any type of application deployment. The 
Enterprise Edition includes SmartPatch, 
WebDeploy, and SetupCapture. The WISE 
installation System creates professional 
installation programs for Windows, Windows 95. 
and Windows NT. WISE is a completely 
Windows-based installation editing/ 
testing environment. 



$645 


ProEssentials" 

by GigaSoft 
16- and 32 bit DLL, OCX, VCL, 
and VBX interfaces providing 
charting functionality with 
consistent visual quality, 
real-world practicality, end 
overall professional appeal. 

Suited for engineering, 
financial, data-acquisiuon, 

and information system development. Be sure to find the best 
tool for your important development needs. Download a 
demo [BOOK] or get a fully functional 
evaluation edition |3Meg] at Paradise No. 

wvwif pp a rad i se .com/p u b I is hers/g i g asoft qqq Q1 1 g_py 



$345 


Visual SiickEdit 

by MicroEdge 
This award-winning editor i 
development productivity, reduces 
costs of software maintenance and 
improves software quality through 
powerful features, software 
standardization and compatibility 
with your existing environment 
Software Standardization 

With its multi-platform presence, integration with industry 
leading development environments and 
compatibility with version control systems. Paradise No, 

Visual SiickEdit provides your entire M39 0122-FT 

organization with a standard O OOl: 

coding environment. 2>Z03 



Sax Basic 
Engine Pro* 

by Sax Software 
Give your apps the 
power of a VBA- 
compatible macro 
language. Drop it onto 
your form and get 
started right away. 

The Professional Edition gives your apps 
a macro editor and debugger loaded with 
features. The Enterprise Edition also 
includes a high-performance server macro 
engine with mufti-threading support. 



Paradise No. 
SOB 0310-FT 

$455 


VBTools 6 

by BeCubed Software 
Update your 16-bit applications 
today with any of the more than 60 
custom 16-bit VBX controls! You'll 
save bunches of time with the new 
International control because 
it assigns internationalized text to 
all loaded controls! And 
there's a Flow Charting 
control, a 2D Slider, a \ jVeFr 
Floating Text Extender ) j/ersion 
and more, plus many 
other enhancements! 
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Ultimate Grid Studio 
Professional Edition 

by Dundas Software 
Here it is I Now you can get 
the Ultimate collection of grid 
components from one reliable 
source, Each component fs 
designed to be the best in 
its class. Package includes 
Ultimate MFC, ActiveX, 

Java, and SDK grids. 

Components also 
available separately. 
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VEDITPLUS 5.1 

by Greenview Data, Inc 
Quickly edit absolutely any 
text, data or binary files up 
to 2 Gigabytes in ASCII, 

Hexadecimal or EBCDIC. 

The new VEDIT PLUS for 
Windows/DQS does it all: database; 
mainframe; CD-ROM; Postscript; C; HTML; 
programming; word processing; translating; 
and converting. Exceptionally fast and 
compact. Fully configurable and 
programmable. A favorite since 1980. 
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$135 


Spread v2.5 

by FarPoint Technologies 
A complete spreadsheet control 
for most environments that 
support a VBX. OLE control, or 
DLL. Use as a spreadsheet to 
obtain variable lines of data or 
display tables of information. 
Data-aware—connect to 
databases with the Access Engine 
and ODBC. Includes over 250 
properties and our improved 
Spread Designer. Formulas, sorting, 
and full-print support too. 



Paradise No. 
FD2 0110-FT 

$229 


Order online 24 hours a day, every day at mvw.pparadise.com 
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products! Use our product search button <§ order online 24 hours a day, everyday! 



RoboHELP Office 
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B13 0231 -FT 

$665 


The Fastest & Easiest Way to Create Help Systems 


by Blue Sky 1 '' Software 

ftohoHELP 5.5 

The Best Selling Help Authoring Tool* 

RoboHELP is the fastest and easiest way to create online Help for any 
platform. RoboHELP turns Microsoft Word into a full-featured authoring; 
tool capable of creating professional Windows Help, WebHelp, Microsoft 
HTML Help, Netscape Net Help, printed documentation, intranet/Internet 
Web sites, and Windows CE Help—alt from a single source. 

ftoboHELP Office 5 5 'Based on independent market survey. 

The Complete Help Authoring Solution 
RoboHELP Office provides a complete Help authoring solution with a 
collection uf powerful Help and HTML tools for authoring professional 
Windows Help, WebHelp, Microsoft HTML Help, Netscape NetHelp. 
printed documentation, intranet/lntemet Weh sites, and Windows CE 
Help—all from a single source. Includes RoboHELP. Robe HTML 
(at no extra cost!), plus over 16 powerful Help authoring utilities. 
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$475 


LEADTOOLS ActiveX 
Pro 32 

by LEAD Technologies, Inc 
LEADTOOLS offers technology in alt 
major imaging categories for color, 
grayscale and bitonal Imaging, with 
comprehensive functionality in each 
category. LEADTOOLS is an integrated 
development fool kit with more than 
500 functions, properties and 
methods. Common Dialog boxes make LEADTOOLS easier to use 
and add-on modules are available for additional FlashPix, OCR, 
and Video support. Paradise No. 

Lift 0132-FT 

$335 




RoboHTML 

by Blue Sky* Software 
The Ultimate Way to 
Author HTML Help 

RoboHTML is an HTML Help 
Authoring tool that provides a rich 
WYSIWYG editor with full drag and 
drop support, automated project 
management, and complete testing 
features. 'Unlike HTML editors, 

RoboHTML has built-in support for all 
the HTML Help features such as: Full drag and drop support for 
creating dynamic Table of Contents; multi-level 
Indexes; and Related Topics. Also supports Paradise No. 
easy creation of Popups, Navigation Buttons, B13 031D-FT 
Splash Screens, Shortcuts, Import existing 
Win Help projects and more. 


$469 


Doc-To-Help 

by WexTech Systems 
Create HTML Help, Windows 
Help for 95, NT 3.51/4.0 and 
3.0/3.1, HTML and printed 
documentation from one file, 
at the touch of a button, with 
Doc-To-Help 3. WexTech, the 
Microsoft acknowledged expert 
in HI ML Help authoring, 
incorporates our experience 
in Doc-To-Help 3, enabling you 
to create great HTML Help. 
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$399 


KEDIT for 
Windows 1.5 

by Mansfield 
Software Group 



includes both 32-bit 
Windows 95/Windows NT 
and 16-bit Windows 3.1 
modules. KEDIT is a powerful 
general purpose text editor with 
redefineable keys, undo/red a, selective line editing, regular 
expression support, enhanced syntax coloring, column oriented 
editing, file locking, a macro debugger, an ISM XEDIT- 
compatible command set, prefix area support, 
and morel The macro language is a subset Paradise No. 
of REXX. Also available in DOS and OS/2 man KE31 -FT 
text mode versions. 






Codewright Pro 

by Premia 

You'll work faster and easier 
when you use the right pro¬ 
grammer's editor, Codewright. 

Now you'll browse code faster 
with Outline Symbols. It 
gathers information about 
your code in the background. 

You'll juggle changes with ease, with 
Difference Editing. It lets you selectively 
combine the changes from two revisions 
The new Bookmarks Window lets you view book¬ 
marks by name and by file. Now with synchro¬ 
nising technology for Delphi and Visual C++ 

IDEs. With the help of the API Assistant, 
making complex function calls is as simple as 
filling in a form. 


Paradise No. 
P36 0111-FT 

$199 


List Pro 

by FarPoint Technologies 
A set of customizable List Box and 
Combo Box controls. Create header 
groups and sub header groups 
end/or multiple columns to create 
a very unique list control. Merge 
celts in rows with same text to 
make display more user friendly. 

Bind to supported databases using 
Microsoft Jet Database Engine or 
Microsoft Access ODBC. Virtual memory 
manager allows connection to any size database. 
Supports single or multi-select lists, embedded 
icons and bitmaps in columns and rows, plus 
drag-and-drop capabilities. 



Paradise No. 
FD2Q211-FT 
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Anyone can become an "innovator" 
using the software development tools 
youll find in Programmer's Paradise 
catalog! We feature the best products 
from today's top publishers—at the 
best prices, of course I 

Here's just a small sampling of what 
you'll find in the Programmer's Paradise 
catalog. To find all the tools you need 
to succeed—and innovate—call 
us today! 

A Personal Guarantee! 

If you're not happy, we're not happy] 
You have my personal guarantee that 
we' SI provide you with outstanding 
products, support and service," 
-Roger Paradis 
President & CEO 
Programmar's Paradise, Inc. 


ORDER NOW 

or call for your free catalog! 

1 - 800 - 445-7899 


Call for shipping charges/retum policy. 
Prices subject to change without notice, 
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or TAX 732-389-9227 


























































Developers: get the 
at a price you can’t 



^Includes a limited license for development and testing only: BackOffice license limited te ten (10) simultaneous comectlora. ** Offer good through Jm 30. Whale supplies last No shipping or handling costs. 
® IMS Microsoft Corporation. AH rights reserved. Microsoft. BackOffice. Visual Basle. Visual C++. Visual Studio. Windows, and Windows NT are either registered trademarks or trademarks ot Microsoft Corporation 
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Kudos for Free 
Software Pioneers 

The Electronic Frontier Foundation has 
awarded its 1998 Pioneer Awards to free 
software advocates Richard Stallman and 
Linus Torvald.c (“Free software 5 ' refers to 
the freedom* not the price, to redistribute 
the software, either with or without 
changes, either gratis or for a fee,) Stall- 
man founded the GNU project in 1984, 
which led to GNU, a UNIX-compatible 
operating system. He is also the princi¬ 
pal author of GNU Emacs, GNU C Com¬ 
piler, GNU Debugger GDB, and Other 
GNU programs. Torvalds, a recipient of 
the 1995 mi Dr. Dobb s Excellence in Pro¬ 
gramming Award/’ developed Linux, a 
free UNIX-like kernel, which is estimat¬ 
ed to have over five million users. 

— Jonathan Erickson 

PSCs: Personal Supercomputers 

Interested in owning a Cray supercom¬ 
puter? Tony Cole is your man. Cole's 
company, MemoryRilia (San Mateo, CA) 
sells various pieces of Cray supercom¬ 
puters for prices ranging from a few hun¬ 
dred dollars for memory boards encased 
in Incite to several thousand dollars for 
a Cray-1 chassis (for those in the market 
for a very expensive bench). Five years 
ago, Cole bought Lawrence Livermore 
Labs 5 Cray-J, which was about lo be 
junked, for $ 10 , 101.01 (it originally cost 
$19 million). Two years later. Cole called 
Cray Supercomputers to try and track 
down more equipment. After persisting 
for tw o months, Cole finally got through 
to Seymour Cray himself, and managed 
to buy a Cray-2, Cray-3, and Crav-4 for 
$5000. Cray may have got the better of 
die deal, since S5OG0 was more than they 
would have received from the junkyard. 
Cole has since added a Cray-YMP and 
Cray-XMP to his collection, and is plan¬ 
ning to expand his business to other 
computer relics such as core memory. 
More information is available at http;// 
www. memorybili a com /. 

—- Eugene Eric Kim 

Smart Dialing 

These days, cellular phones and other 
hand-held devices can perform every 
communications task imaginable: e-mail, 
voice-mail, paging, web browsing. But 
even with all this palm-sized power, en¬ 


tering information remains a serious prob¬ 
lem. Rather than miniaturize existing in¬ 
put devices or experiment with hand¬ 
writing or speech recognition, Tegic 
Communications decided to make exist¬ 
ing input methods smarter. Using a 12- 
button telephone keypad with three let¬ 
ters per key, Tegic s T9 software takes a 
sequence of keypresses, and compares 
the possible variations of character se¬ 
quences with a dictionary to determine 
tite word the user meant to type. The al¬ 
gorithm is parent-pending, and Tegic 
Communications $ currently licensing dic¬ 
tionaries in English, French, German, 
Spanish. Italian, and Korean. For more in¬ 
formation, see http://ww w.tegic.com/, 
—Eugene Erie Kim 

Let it Snow... 

If you didn't get enough of last w inter's 
cold and snow, check out http: //www 
.mageesci.com/Antarctic/, w here Na¬ 
tional Science Foundation # (NSF) re¬ 
searcher Tony Hansen is posting video 
clips of liis “vacation” to the South Pole. 
Hansen s primary goal is to collect in¬ 
formation regarding the degree of envi¬ 
ronmental pollution, specifically aerosol 
black carbon. 

— Jonathan Erickson 

Math for the Web 

The World Wide Web Consortium ( W3C) 
has announced the release of Mathemat¬ 
ical Markup language (MathML) as a W3C 
Proposed Recommendation. Math ML is a 
low-level syntax for representing struc¬ 
tured data (such as mathematics) and 
mathematical expressions over the Web. 
In other words, MathML is an XML-com¬ 
pliant markup language that describes the 
content and presentation of mathemati¬ 
cal expressions.. MathML provides tw*o 
sets of markup tags; one for presenting 
notation of mathematical data in markup 
format, and the other for relaying the se¬ 
mantic meaning of mathematical expres¬ 
sions. For more information, see- http:// 
www, w3. org/M a th/, 

— Jonathan EHckson 

The Taxman Changes 

According to a recent IRS pronouncement— 
Statement of Position (SOP) 97-2, Software 
Revenue Recognition— software compa¬ 


nies must further standardize financial re¬ 
porting. Tin's may resull in changes in 
the way some companies recognize rev¬ 
enue, For example, some companies may 
be required to defer reporting revenue 
longer, which could negatively affect 
short-term earnings. 

— Jonathan Erickson 

Advances in 

Nanoeiectromechanical 

Technology 

Researchers at the University of South¬ 
ern California's Laboratory for Molecular 
Robotics have used a uniquely pro¬ 
grammed atomic force microscope as a 
robot to push gold panicles 15 nanome¬ 
ters in size into precise locations on a 
pdy lysine-coated mica surface, spelling 
out the letters u U5C. n The gold particles 
are about 500 times smaller than a red 
blood cell and comparable in size to 
some molecules, Ari Requicha, profes¬ 
sor of computer science and electrical 
engineering/systems and leader of the 
USC team, says that “Control over the 
structure of matter at the atomic or 
molecular scale will undoubtedly trigger 
a major revolution in man-made arti¬ 
facts.” Applications for this type of tech¬ 
nology (which is called “nanoelec¬ 
tro mechanical systems/' or NEMS for 
short) include cell repair, ulirastrong ma¬ 
terials derived from molecular]y perfect 
prototypes, or compact disc machines 
with a thousand times more capacity" than 
current models. For more information, 
see hit p: f j Www - 1 m r. use . ed u/~ lmr/. 

—Jonathan Erickson 

Tel Goes it Alone 

John Ousterhout, the creator of Tcl/Tk, 
has left Sun Microsystems to found Scrip- 
tics, a company devoted to scripting tods, 
applications, and services, Seri plies’ flag¬ 
ship product will be TcIPro, a develop¬ 
ment environment for Tel, which will in¬ 
clude a source-level debugger and other 
tools. While no official release date has 
been set, Ousterhout expects the product 
to be ready by the fall. 

Ousterhout noted, a We're really excit¬ 
ed about this opportunity. Well get to do 
a lot of things for Tel that we've wanted 
to do for a long time/' More information 
is available at http: // www.scriptics,com/ 
— Eugene Eric Kim 
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Date Compression 
and Year 2000 
Challenges 


Storage overflow is the problem 


Robert L. Moore and D. Gregory Foley 

M any systems with Year 2000 (Y2K) problems store dates 
in “MMDDYY" form The first two places in this six-place 
field represent the month, the next two the day, and the 
final two Lhe year. There are many different ways to en¬ 
code MjMDDYY information. Typically, each place Is an integer 
type, or some other format (like characters), in the mechanics of 
the machine in question that represent the month, day, or year 
Six places can store a lot of information, but storage formats 
such as MMDDYY don't take advantage of this potential. For in¬ 
stance, the MM field only counts from 01 to 12, and the DD field 
only counts from 01 to 31 Neither field uses any of the remaining 
digits of storage potential. The MM field could (but does not) 
store the digits 13 through 99 (and 00), The DD field could (but 
does not) store the digits 32 through 99 (and 00). Potential stor¬ 
age space is lost because of the way information is recorded. 

Ordinarily, not using the potential storage space available in 
six places doesn't mailer. However, the field used to store the 
year is nearing the end of its storage capacity. The YY field will 
overflow when it goes from 1999 to 2000,1because it counts from 
00 to 99 (1900 to 1999 a.ix, for instance). This overflow, in ad¬ 
dition to other peculiar computer events that occur around Jan¬ 
uary' 1, 2000, (the use of '‘99" in the date field as a "save forev¬ 
er" indicator, the occurrence of a 400-year leap-year in 2000, 
the roll-over of the GPS time field from August 21 to August 22, 
1999, to mention just a few) is ai the heart of the the much- 
heralded Year 2000 problem. 

Y2K is not just a software problem, It also impacts hardware, 
firmware, and networks. Y2K affects mainframe, client/server, 


Boh and Greg are senior software engineers at Coastal Research 
and Technology Inc. Boh is the author of a Year2000 compliance 
criteria checklist which is widely disseminated throughout the US. 
intelligence community, Both Boh and Gtvg can lx contacted at 
rimmretMeroLs. com . 


desktop, and embedded systems. Neither is it just a Cobol prob¬ 
lem. Applications written in Fortran, PL/1, C, C++, Perl, Ada, and 
the like are at risk for Y2K problems. The solutions we present 
here are from the viewpoint of C on a client/server system, but 
every comment is generally applicable. Ultimately, fixing Y2K 
problems is principally about fixing storage overflow. 

Overflow eventually occurs in any storage format that records 
finite information. However, the MMDDYY date format over¬ 
flows much earlier titan necessary because of inefficient use of 
storage space in the MM and DD fields. 

Y2K Solutions 

A variety of solutions are available for fixing the overflow ver¬ 
sion of the Y2K problem. The two best known are expand¬ 
ing the MMDDYY date representation to MMPDYYYY (that 
is, using four digits for the year rather lhan two) or using a 
' logic 1 ' solution such as windowing (writing source code to 
interpret years into the correct century over a 100-year time 
span, for instance). Most Y2K repair discussions focus on 
these methods. (Another solution, applicable in limited cir¬ 
cumstances, is to use tlie 28-year cycle of the Gregorian cal- 
en da r' s a ssc >dat i on 1 let ween 11 ic da y -1 >(-11 te- week and I he d a I e 
in the years 1901 to 2099 inclusive. For example, July 4, 1998, 
Ls a Saturday as is July 4, 2026. We won't address that ap¬ 
proach in this article.) 

Date Expansion 

Given enough time and resources, expanding date representa¬ 
tions from MMDDYY to MMDDYYYY is usually the preferred 
method between expansion and windowing. Expansion offers 
.several advantages: 

* il lasts forever (well, from January f 0001, AJX to December 
31, 9999, a.d, anyway). 

* For most people, four-digit year format is the natural repre¬ 
sentation. 

Until recently, Y2K experts recommended four-digit expan¬ 
sion as the solution of choice. These recommendations have be¬ 
gun to change because only a short time remains to implement 
a Y2K fix. This brief remaining time span highlights the draw¬ 
backs to four-digit expansion: 
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* It must be implemented consistently everywhere dates are used 
or declared (source code, scripts, JGL, databases, computer-to- 
computer interfaces, user interfaces, and so on). 

• it uses more storage space than two-digit year-date repre¬ 
sentations. 

Changing from MMDDYY to MMDDYYYY is quite simple in 
abstract. In practice, you must find and appropriately alter all 
relevant declarations, output and input statements, calculations, 
and function calls. Furthermore, any variable that uses a changed 
variable (and any variable that uses tire variable using the changed 
variable, and so on) must be checked and perhaps changed. 

Windowing 

One frequently suggested alternative to date expansion is a log¬ 
ic solution such as windowing. In windowing, dates of the form 
MMDDYY translate on-the-fly into unambiguous (MMDDYYYY, 
for instance) dates. Windowing is a simple concept. Human be¬ 
ings easily implement it mentally when we automatically trans¬ 
late a date like 07/04/96 to 07/04/1998, or 02/29/00 to 02/29/2000. 

Windows come in two varieties—fixed and sliding. Fixed 
windows have a hard-coded comparison algorithm that prefix¬ 
es a U IT to all two-digit dates in a given range (all two-digit 
year dates, where 50 < YY < 99, for example) and a “20* to all 
remaining dates (00 < YY < 50), Sliding windows parameterize 
the comparison so that the range of dates moves as the current 
date changes; see Figure 1 and Listing One (listings begin on 
page 109). 

Regardless of the windowing mechanism chosen, windows 
are quicker and require fewer coding changes to implement Ire- 
cause they only apply wliere usage demands. Consider, for in¬ 
stance, the calculation AGE = CURRENT_DATE - BIRTH_DATE. 
For MMDDYY, this results in a negative (and hence erroneous) 
number if CURRENT_DATE and BIRTH.DATE fall on different 


sides of 01/01/2000. Windowing corrects this error. Windowing 
expands die date variable references to four digits at the point 
of calculation—AGE = WINDOW(CLIRRENT_DATE)-WIN- 
DOW r ( BIRTI l_ DATE). The declarations of AGE, CURRENT_DATE, 
BIR111_DATE need not change, nor does any other usage nec¬ 
essarily need to change just because tills usage needed a win¬ 
dow, Reduced Implementation effort is far and away the prin¬ 
cipal driver for windowing, although windows have the added 
advantage of avoiding the extra data storage space needed by 
date expansion. 

Although w indowing may be the solution of the hour, it is 
not without disadvantages. Windowing is a quick fix— it works 
for the moment. But don't count on it forever The three major 
concerns with windows are: 

* Their usable time span is no more than a century. 

* They create additional testing challenges. 

* They raise the risk that a future maintenance programmer may 

accidentally make an incorrect source-code change. 

Sliding windows somewhat mitigate tine first difficulty. They 
move forward as time advances, extending their life span (as op¬ 
posed to fixed window s w here the useful time span eventually 
expires). Still, windowing solutions are not appropriate in many 
places—- for instance, the AGE calculation, if CURRENT_DATE 
and BIRTI i_DATE are more than a century apart. Unfortunately, 
sliding windows suffer even more acutely from die .second and 
third difficulties dian fixed windows, because they are function¬ 
ally more complex. 

The two common solutions, expansion and windowing, ne¬ 
glect the fundamental cause of the Y2K problem — storage over¬ 
flow'. Computers can store much more information in six places 
than is allowed by the MMDDYY format. Storing more infor¬ 
mation in the same space is the essence of the Y2K solution 
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known as “compression.” Nothing is really being “compressed.” 
Compression simply uses a more concise date representation 
method. Compression is one of the more technically sophisti¬ 
cated Y2K solutions. Unfortunately, many programmers are not 
aware of it. 

Compression 

Compression shares many of the advantages and disadvan¬ 
tages of expansion and windowing. Among compression’s ad¬ 
vantages are- 

* It can last for a very long time (for example, from January 1 , 
1600, a.d. to 768,151,959,528 a.d.). 

* II does not require additional data Storage space tor expand¬ 
ed dates. 

Compression also has comparative advantages with respect 
to windowing and expansion. For instance, compression re¬ 
quires no more source code changes than does expansion, yet 
it can store much more information. Further, except for output 
to human-readable forms, compression requires no function 
calls—a significant advantage over windowing* This does not 
mean that compression is suitable for every situation. Com¬ 
pression works well when: 

* An application is built for the long term. 

* The project to reach Y2K compliance is not a death march. 

* Storage space concerns had some weight in design decisions. 

The compression formats given later provide a wide range of 
alternatives for date storage* We've included C source code for 
converting between MMDDYYYY and each compressed format 
and back again. These calculations rely on knowing leap years; 
see Listing Two. These are useful when conversion to a user- 
readable format is required, or some calculation or interface 
with MMDDYYYY format is needed. 

The compressed Formats described next are in order of in¬ 
creasing date-storage capacity* 

The CYYDDD Format 

Using 1900 as a starting date (you can choose any date), one 
compression method is to count the number of years elapsed 
since 1900 , and the number of days since the beginning of the 
year. The number of years is recorded in a three-digit field, 
where the first digit ( 0 ) represents the number of centuries 
elapsed since 1900 and the second and third digits (YY) are the 
year in the century. Places four through six (ODD) are the day 
count in the year beginning with January 1, For example, July 
4* 1999; is 099185- July 4, 2055, would be 155185. This storage 



Figure 1; Sliding windows. 


format can count 10 centuries, 99 years, and 365 (366 for a leap 
year) days; ii can store a total of 1100 years. Converting MMDD- 
YY to C YYDDD (see Examples 1 and 2) permits dates from Jan¬ 
uary I, 1900, to December 31, 2999* 

This format is useful when some user-readability is important 
but storage space requirements do not permit date expansion, 
and date range requirements make the single-century-at-a-time 
limits of windows unfeasible. Expansion to four-digit years is 
usually preferable in tills case, if practical. 

The DDDDDD Format 

Another approach is to count the number of days since a cer¬ 
tain date rather than the number of years, months, and days. 
For example, storing date information as DDDDDD—counting 
days from January 1, 1900— means the same six places that can 
only store dates from January 1, 1900, to December 31, 1999, in 
MMDDYY can store 1 million days in DDDDDD* This is equiv¬ 
alent to more than 2700 years. Converting MMDDYY to 
DDDDDD (see Examples 3 and 4) lets you track dates from Jan¬ 
uary 1 , 1900 , to past 4600 a.d. 

This format is useful when storage space does not permit ex¬ 
pansion and dating requirements do not allow windows* If pos¬ 
sible, expansion to four-digit years is still preferable* 

The MMDD 16-bit Year Format 

One easily implemented and natural way of storing more date 
information is to leave the month (MM) and day (DD) fields 
alone and replace the year field (YY) with a bit register. The 
two-place year field is equivalent to a 16-hit long register, as¬ 
suming that the computer in question puts eight bits in each 
character byte* Essentially, the format is the functional equiva¬ 
lent of MMDDYYYYY* (Note the extra Y.) Consequently, there 
is no need for a complicated algorithm—the year recorded is 
a YY expanded by windowing, just stored in binary rather than 
characters or digits. Since 16 bits stores 2 ICi -l = 65,535 years, 
dates from January 1, 0001, a.d. to December 3T 65,535, a.d. 
may lx: stored without ambiguity. 

This Format is generally more useful than four-digit date 
expansion, unless user-readability of machine-stored dates is 
a vita! requirement* This format has a far wider range and 
uses less space than four-digit expansion* and converts to 
and from user-readable date information easily. The format 
also requires fewer function calls than windowing, hence is 
more efficient at run time. It also avoids windowing's date- 
span difficulties while using no more storage space than a 
windowed date* 


MMDDYYYY to CYYDDD 

Represent YYYY as (Yl)(Y2)(Y3)(Y4), Then C = 
(Yin0+Y2)-19. The "YY" in the CYYDDD are Y3 and Y4. 
Use MMDDtoDDD,c (Listing Three: available 
electronically: see "Resource Center„" page 3} to 
complete the conversion. 


Example 1: MMDDYYYY to CYYDDD. 


CYYDDD to MMDDYYYY 

Assuming the CYYDDD count begins with i900 A.D., 

YYYY = (I9+CU100+YY. Use DDDtoMMDD.c (Listing Five: 
available electronically) to complete the conversion* 


Example 2: CYYDDD to MMDDYYYY: 
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MMDDYYYY to DDDDDD 

Use MMDDToDDD.c (Listing Three: available 
electronically) to convert the MMDD part of MMDDYYYY 
to DDD—the day count within a year. Then use 
DLLYYYYToDDBDDD.c (Listing Six; available 
electronically) to convert DDD and the YYYY part of 
MMDDYYYY to DDDDDD, 


Example 3: MMDDYYYY to DDDDDD, 


DDLDDD' to MMDDYYYY 

Use DBToDDD4Y*c [Listing Four; available 
electronically) to convert from DDDDDD to DDDYYYY* 
This is a four digit year and a count of days 
starting at January 1 within that year. Uee 
DDUTaMMDD.c (Listing Five: available electronically) 
on the DDD part of DDDYYYY to convert DDD to MKDB. 
The combined result is MMDDYYYY. 


Example 4: DDDDDD to MMDDYYYY. 

(continued from page 22) 

The 48-bit Date Format 

A format that allows storage of very long time spans is to count 
the number of days since a given date and store the informa¬ 
tion as a 48“ bit long register. If (he storage format of die com¬ 
puter being used equates eight bits to one byte, this 48-bit long 
register exactly replaces the MMDDYY date format with respect 
to storage space. This format is similar to the DDDDDD format. 
(Use die source code given with that format for conversions.) 
However, using bits rather than separate integers is a more ef¬ 
ficient storage method. It is so effective that counting from Jan¬ 
uary 1, 1600, a d. allows date storage to 768,151,959,528 ad. 
This is a sufficient time span for most applications. 

The all-bits representation is useful when storing long date 
ntnges. If desired, less than ihe full 46-bit representation may 
be used (say, 40- or 32-long bit register) to store date infor¬ 
mation with reduced storage space* For instance, using four 
eight-bit registers saves two bytes of space and is sufficient to 
store a count of 4,294,967,295 days, or about 11*75 million 
years. This format makes direct user interpretation of stored 
dates very difficult, Unless reduced storage space for dates is 
needed or lung span date storage is desirable, the MMDD 16- 
bit year format is preferable. 

Use the algorithms in Examples 3 and 4 (MMDDYYYY to 
DDDDDD and DDDDDD to MMDDYYYY ) for conversions. 
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Conclusion 

The Y2K literature suggests many other date storage formats. 
These formats arc often variations of those suggested in this ar¬ 
ticle (MMDDHH, counting years in hexadecimal rather than dec¬ 
imal digits or HDDCYY, counting months in hexadecimal and 
years as a century offset (Q plus years, and so on). The com¬ 
parative advantages and disadvantages of these formats are sim¬ 
ilar to the earlier formats. 

Date compression as a Y2K solution is not appropriate ev¬ 
erywhere, For instance 4 , windowing is die necessary choice of 
desperation for many, given the short time span in which we 
must implement a solution. Expansion is a straightforward choice, 
easily understood by anyone. However, when a project is not 
on a death march, when time spans longer than a century must 
be stored, or when using extra storage space is inconvenient, 
compression is an ideal Y2K solution. 

DDJ 

(listings begin on page 109.) 

Dr. Dobh s Journal May 7998 

















^^SiTRODUCTioN^CHE^ 


W j n H nws Platform Re leaseDate 


ReleaseDate 


UNIXPIatform 


Use MainWin To Port 
NT Code To UNIX, 
And You'll Only Have 
T o Rewrite One Line. 


Fast. Flawless. Flexible. When you use 
the MainWin cross-platform toolkit, 
you write apps for Windows once, 
then port the source code to UNIX, 
using UNIX-native WIN32 APIs. There's 
no need to double your programming 
efforts, no need to rewrite your code. 

MWMNSOFr.COM 

Plus, you can still retain the same Test 
and QA staff, and even use the same 
documentation for both versions, 
The comprehensive MainWin environ¬ 
ment ensures native performance to 
your existing UNIX customers, while 
speeding your own migration to the 
expanding world of Windows NT. 

MA^SOFT 

You will significantly reduce your 
development costs and simultane¬ 
ously launch new applications on 
Windows and UNIX. For an evaluation 
copy of MainWin XDE, call us, send 
email to democopy@mainsoftcom, 
or visit our website. A few seconcfe with 



lb might save you months of hard work. 

1-800-MAINWIN 


m INWIN xde 


AD LINK 91 

C 1 Mninsnlt Corpnrntiiir.. 1270 Onhncfid PftHcwne. Strife i ID. Si.iniivv.i 3 L*, C.A Trleptunre; 1 40Hi - ' 7 4-.?40ll, E'jx: {408) '4-;l4<M. Main Win is .i Rigistcced trademark „iud NLtiniirifr is a nvnJcmni'k of Malnsofr Corporation, 

All other products or services names are trademarks nr registered trademarks. of their respective owners. 





Strategies for Solving 
the Y2K Problem 


Analyzing, converting, 
and testing legacy code 

William Gothard and Les Rodner 

T he bulk of year 2000 work involves 
determining which applications must 
be fixed first, coordinating their re¬ 
pair with other internal and external 
systems, and testing repairs. This is a lot 
of work, and selecting the right conver¬ 
sion method can lead to significant time 
savings and affect the success or remedi¬ 
ation efforts. In this article, we ll discuss 
the analysis, conversion, and testing pro¬ 
cesses, providing examples in GoboL 

Dote Field Analysis 

Before the year 2000 conversion process 
can begin, all date fields must be locat¬ 
ed in the application. Regardless of what 
type of remediation tool is being used, 
the single most important step in the en¬ 
tire year 2000 date impact analysis and 
conversion process is Lhe creation of the 
character string or pattern list. This list 
contains all of lhe possible date field 
patterns in lhe application (MMTE, *- 
YY, for instance) and literally drives the 
entire impact analysis and conversion 
process. 


William is a senior development engineer 
at Intersolv specializing in year 2000 
methodologies and remediatkm tools. Les 
is project manager/tea m leader for Inter¬ 
solv Factory2000, They can he contacted 
at wiUiam_gothard@intersolv.com and 
ks_rodner®intersolve. com , respectively. 


Special care and consideration must be 
followed in the creation of lhe pattern list. 
If too many variations or combinations of 
date patterns are used, too many false pos¬ 
itives (items marked as date fields thai are 
not actually date fields) will crop up. If a 
pattern is inadvertently left out, the pos¬ 
sibility exists for false negatives (missing 
a date field). The ultimate goal is to min¬ 
imize false positives and eliminate false 
negatives. By testing sample code from 
the application, the pattern list can be fine- 
tuned to an acceptable tolerance of false 
positives/false negatives. 



Missed date fields are difficult to iden¬ 
tify during the initial search process. An 
essential step that aids in locating any 
missed dates is an analysis that occurs af¬ 
ter completing the initial pattern search, 
a process known as “propagation” or 
"synonym” analysis. This technique dis¬ 
covers fields that do not have typical date 
field names, but are involved in date field 
actions (for example, MOVE WS-DATE TO 
W5-HQLD1). Chances are that the pattern 


search process would report on the en¬ 
tire line of code as a Y2K- impacted state¬ 
ment due entirely to the discovery of WS- 
DATE, but would not be capable of 
reporting on the line of code if WS-HOLD1 
were then moved to another obscurely 
named field such as WS-HOLD2 (MOVE 
WS-HOLD1 TO WS-HOLD2, for example). 
Here we have two named date fields that 
would most likely be missed by the ini¬ 
tial pattern-list search process. The po¬ 
tential for this type of situation requires 
that further analysis be done to discover 
any propagated or synonym fields. “Indi- 
reels” analysis independently searches for 
any parent/child/redefmes/FD relation¬ 
ships m regard to the previously discov¬ 
ered date field, and adds any discovered 
date fields in the list of impacted items. 

Another consideration when doing im¬ 
pact analysis is narrowing down the size of 
die fields to lie searched In Cobol, for in¬ 
stance, there are time stamps with lengths 
of up to 26 characters that contain poten¬ 
tially impacted dates. On the other end of 
tile spectrum, fields with a length of one 
can usually be eliminated because they al¬ 
most certainly have nothing to do with dates. 

Date Conversion Methodologies 

While there are several methods for han¬ 
dling date conversions, well focus on 
two —expansion and windowing. Ex¬ 
pansion involves convening the physical 
field sizes to store the century value, thus 
achieving full four-digit compliance in 
both the application and data. This re¬ 
quires changes to both die file structures 
and all associated application programs. 

In Listing One (listings begin on page 
109), tlie two YY fields were identified for 
expansion in the BULCOM-REC. Based on 
the decision to follow a date-expansion 
metliodobgy, these fields must l^e expanded 
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(continued from page 26) 
to store the two- character century values. 
Listing Two is die revised code using ex¬ 
pansion. When these two date fields are 
expanded from two to four bytes, it in¬ 
creases the overall size of the BIILCGM- 
RECOltD from 100 to 104 bytes* thus cre¬ 
ating the potential impact on all applications 
downstream that use die same B1LLCOM- 
RECORD. Keep in mind dial it is not a safe 
practice to subtract from any FILLER field, 
since another application could be using 
the FILLER area for a specific purpose. 

With expansion of die Virtual Sequen¬ 
tial Access Method (VS AM) file, job Con¬ 
trol Language (JCL) record lengths, block¬ 
ing factors, and catalog parameters must 
be revised for all instances that use the 
expanded record. Expansion is a perma¬ 
nent solution. However, the level of effort 
and time required to analyze all Interrela¬ 
tionships across applications—including 
the reorganization and restructuring of 
files, copy books, and databases—makes 
expansion a less viable option as the year 
2000 approaches. 

Even if a system has been fully con¬ 
verted using expansion, exposure still ex¬ 
ists if the system is dependent on any ex¬ 
ternal interface files coming in from other 
systems or companies. If those files have 
not been converted using the same ex¬ 
pansion methodology that has been used 
in your applications, you must implement 
a bridging program that converts the in¬ 
coming date to a format die converted tar¬ 
get system can understand. Next, you must 
undergo the extremely risky task of de¬ 
ciding when to eliminate the bridging pro¬ 
grams based on external system or inter¬ 
face file revisions. 

The expansion scenario poses die great¬ 
est risk, especially this late in the game. 
Conversion takes die most time due to the 
interface analysis that must occur. It has 
die highest risk of failure due to errors of 
omission. Organizations widi large appli¬ 
cations that have not yet started the con¬ 
version process should not consider ex¬ 
pansion, except for cases where expansion 
is the only possible choice. 

Windowing 

Windowing involves interpreting the cen¬ 
tury date field by inserting fixed or slid¬ 
ing windows into die application program 
logic. This enables programs to process 
two-digit dates without converting the 
files. Windowing based on a “pivot date" 
year (Year=30, for instance) is a reason¬ 
able approach that will; 

* Achieve Y2K compliance in less time. 

* Require less effort. 

* Provide a targeted solution that will en¬ 
able applications to function while plans 

are made to migrate the process to an 


environment that enables storing centu¬ 
ry values. 

The windowing technique is fairly 
generic and can be adapted to any appli¬ 
cation by adjusting the pivot year date. 

I n Lisdng Three, a commitment must be 
made to retire the application's environ¬ 
ment by die year 2029 or else rewindow 
the application as the year 2030 ap¬ 
proaches. The re window effort may be 
more straightforward if the windowing 
code technique stores the pivot year lit¬ 
eral as a field name in Working Storage 
and common routines are used to execute 
the window logic. 

Suppose the value in POLICY-START- 
DATE is 290101 (January 1, 1929). In this 
case, Listing Three will conclude that the 
year is 2029. Now, suppose the value in 
POLICY-START-DATE is 300101 (January 
1, 1930). In this case, Listing Three will 
conclude that the year is 1930. 

Windowing's impact on data stored in 
files and databases varies depending on 
die application type (personnel, banking, 
insurance, telecommunications billing, and 
so on). All affected data stores must be 
thoroughly analyzed to identify those in¬ 
stances containing dates that will not fall 
within the 99 years that span the pivot 
year boundaries. 

Windowing analysis is only concerned 
with cases such as 

* Test date fields for greater than/less than 
conditions between fields and/or literals. 

* Test date fields for equal conditions 
against date literals (equal tests between 
date fields are ignored). 

* Subtracting a value from a year. 

* Actions involving reference modification. 

* Moving century literals. 

* Existing window logic. 

Windowing's primary advantage over 
expansion is that you don’t have to wor¬ 
ry about legacy data integrity other than 
finding the correct pivot date. Bui you still 
have to find every date-related manipula¬ 
tion function. For windowing to simplify 
matters when implementing a more per¬ 
manent solution, care must be taken now 
in how the analysis results are docu¬ 
mented. If done correctly, the window 
strategy will permit future rewindowing 
efforts to move the pivot year efficiently 
and in a timely manner. A rewindow ef¬ 
fort would only involve revising the piv¬ 
ot year value stored in Working Storage, 
plus any unique workarounds document¬ 
ed during the original window imple¬ 
mentation. 

An effort to retire/rewrite the system 
should include a complete reengineering 
strategy to include all of the major sys¬ 
tem development life cycle (SDLC) phases 
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(continued from page 28) 
of requirement analysis, functional design, 
detail specifications, and so on. During 
this reengineering effort, existing logic 
may be determined obsolete and formu¬ 
las may prove outdated, in addition to re¬ 
ports/screens that may require total re¬ 
vamping or elimination, The Year 2000 


Missed date fields 
are difficult to 
identify during the 
initial search 
process 


effort will provide an accurate inventory 
of all system components that will be use¬ 
ful as a starting point for reengineering 
activity. 

Determining Pivot Dote 

The pivot date year selected depends on 
the application or business requirements. 
It is usually determined by the oldest date 
that is processed and is also affected by 
any forward date' processing logic. 

To adapt the data to the pivot year cri¬ 
teria, unique approaches are required. 
These include logic that recognizes spe¬ 
cific instances that have out-of-bound 
dates and proceeds to execute unique log¬ 
ic for those instances. For example, 
records with the year of 100101 (January 
l, 1910) would incorrectly be windowed 
as January 1, 2010. In this scenario, the 
pivot year may be lowered to accommo¬ 
date the early date but would reduce the 
future life span of die window. This would 
create extra expense for the company to 
have the window pivot year changed 
again in the near fiiture. Two options 
available to work around this case are: 1) 
Identify these exception records to the 
program (social security number, account 
number). When an exception record is 
encountered by the program, special log¬ 
ic is executed that windows and process¬ 
es the affected dates based on a non¬ 
standard pivot year. 2) Create a file to store 
these exception records and process these 
records in a separate program. Tilts re¬ 
quires extracting these records from the 
file: creating an identically formatted file 
to store only these exception records; se¬ 
lecting a pivot year based on the '‘oldest” 
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• To enable the code review to easily 
identify the obsolete logic; 

• To provide a quick reference that aids 
in resolving problems detected during 
system testing; 

• To permit rebuilding of original logic in 
a timely manner should the Y2K logic 
prove invalid or unnecessary. 


Compound conditionals can be espe¬ 
cially challenging. The original tests should 
fie done before executing the window log¬ 
ic, Unconditional windowing of dates 
should lie avoided. This eliminates the risk 
of receiving abnormal terminations 
(ABENDS) due to the failure of consider¬ 
ing inclusive/exclusive AND and inclusive/ 
exclusive OR relationships. 

listing Seven illustrates this approach. 
The year 2000 logic must maintain die 
original logic’s assumption that it is only 
100 percent guaranteed that the date val¬ 
ues will conform to die date field PIC 
clause when field B-CLM-STU5-CD = O, 
When B-CLM-STUS-CD has any other val¬ 
ue tlian Q, then the test will not occur and 
the content in the date fields is unknown 
and not impacted per die original logic, 


Conclusion 

The year 2000 may very well lie the largest 
task an organization ever faces. Selecting 
die right techniques and tools can signif¬ 
icantly affect your time savings. With the 
windowing method of date conversion, 
you can achieve compliance in less time 
with a manageable level of effort, At this 
point, every hour counts. 


(continued from page 30) 
date (usually pivot year = “01” is a preferred 
value); cloning die program that has be¬ 
come Y2K compliant and changing the 
pivot year value; and processing the stan¬ 
dard file through the standard pivot year 
program; processing the exception file 
through die cloned program diat contains 
the nonstandard pivot year. 

These options are only suggestions. 
Unique cases will require the application's 
business and technical experts to devise 
strategies that solve cases that will not 
adapt to a 99-year window strategy, 

External sorts based on dates require 
redefining the sort date location and iden¬ 
tifying where the year is located. Sever¬ 
al sort utility vendors offer add-ons that 


intercept die sort; append the century per 
the application rules (the pivot year); sort 
the file; strip the century; and generate 
the sort output file in the original format 
and correctly sequenced based on Y2K 
criteria. 

Testing Date Fields 

Window routines must test the date field 
for all Os and all 9s. If this is the case, then 
the expanded field's century value is set 
to 00 or 99 per the input value (liypass¬ 
ing the pivot date test). Listing Four illus¬ 
trates tills. 

When a year is equal to 00, the logic 
must decide if the value 00 represents an 
initialization or the year 2000. If a date is 
defined in the program (YYMMDD, YY- 


DDD, and so on) and the original logic 
only deals with the year portion of the 
date field (dial is, die YY value), then die 
unused part of die date field must still be 
evaluated. If the unused portion of the 
date (MMDD, DDD) is numeric and not 
equal to all zeroes or all nines, then die 
year portion of die date field contains valid 
data (for this case, year = 00 must be ex¬ 
panded to 2000). If die unused portion of 
the date is all zeroes (MMDD^OOOO, 
DDD-000), then the year equal to 00 is 
not a valid year and the century must be 
set to 00 not to 20, 

Listing Five illustrates the original code 
that requires the technique explained 
above. The revised Y2K-compliant pro¬ 
cedure division logic {windowing has been 
executed elsewhere, and is not shown in 
this example) is in Listing Six. 

When dealing with situations similar 
to that in Listings Five and Six, it's best 
to comment out the original logic rather 
than delete it. The advantages of doing 
this are: 
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1 - 800 - 463-8998 

Email: sales@lbridge.com 

Phone:(905)513-7800 

Fax:(905)513-1330 


RKJe i~ 

ward 

Your Best Visual C++ Programming Companion 
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Pentium*]! 


Dells expertise in industry standard 
technology gives the Dell Workstation 
40D a price-performance edge over 
selected models from significant 
competitors. This chert reflects a 3D- 
graphics-onented benchmark modeling an 
environment similar to mechanical CAD. 

Sprue per composite score. Lower number indicates greater price value. For more fofoeplb inffainmatlon, refer towww.£pecbench.org/gpc/iipcyopc.cdlrs.suini[naiiv.hlml. ^Copyright 1998, Standard 
Perfrurmance Evaluation Carpnratian. information vgfe ronect as qf print date 


Viewport CDRS Benchmark ' 1 Test 

Workstations - Relative Price Performance^ 

Dell 

Workstation 400 266MHz w/ELSA Gloria-M Glint MX 

£121 

IBM* 

IntelltStation M Pro Model 26U INGR w/In tense 3D Pro 220(MT 

SI 78 

Compaq 

Professional Workstation 6000 Diamond Multimedia FireGL4000 

S242 

Sun Ultra 

Ultra 1 CreatorSD Modal 170E w/Creator3D 

5446 

SGI 

02 RSQDO/IBDMHz SC02 

$461 

Digital 

Personal Workstation 266t w/AccelGraphics AccelPro 25Q0TX 

$424 

SGI 

tktane 1x175MHz R100Q0 SI w/Tram 

$672 




















Get ready for takeoff. Because the Dell" Workstation 400 flies. From its high- 
bandwidth dual Independent bus architecture to its single or dual fast, 266,300 
or333MHz Intel Pentium'" II processors, the word is "go" Even in the midst of 
the demanding data and graphics applications, it always feels like there's power 
to spare. And there is, thanks to all the high-performance industry components 
inside. Like the 3D, financial modeling, CAD, authoring and animation video cards 
and even enhanced audio capabilities such as integrated 16-bit Sound Blaster 
Compatible Sound. What's more, the Dell Workstation 400 has been compatibility 
tested with leading software applications to help ensure that your critical 
applications are ready to roll. But the real rush comes from the fact that yours 
will be custom-built from the ground up. And it's backed by a $9.8 billion global 
company* that can support it (and your company! in every way. With system 
engineering, 7x24 technical hardware telephone and online support, consulting, 
global integration, leasing and asset management The Dell Workstation 400 
comes with everything you'd expect in a workstation. Everything, that is, except 
the high-flown price. 


ENTRY-LEVEL WORKSTATION j 

1 MID-LEVEL WORKSTATION 

DELL* WORKSTATION 400MT 

DELL WORKSTATION 400MT 

* 3QGMHz Pentium" II Processor 

•Dual 300MHz Pentium 11 Processors 

(Dual Processor Capable) 

* 128MB ECC EDO DIMM Memory 

* 128MB ECC EDO DIMM Memory 

• 4GB Uftra/Wide SCSI 3 Hard Drive 

* 4.3GB EIDE Hard Drive 

(7200 rpm) 

* Matrox Millennium II PCI with 8MB 

* APPIAN Jeronimo J2 Graphics Card 

Dual Ported WRAM Memory 

• Two 1000LS (15.9" v.i.s.) Monitors 

* 1200HS (17.9" v.i.s.) Monitor 

• 24X Max 5 Variable EIDE CD-ROM Drive 

* 24X Max 5 Variable EIDE CD-ROM Drive 

• Integrated 16-bit Sound Blaster 

■ Integrated 16-bit Sound Blaster 

Compatible Sound 

Compatible Sound 

• Integrated 3Com ID/100 EtherLink 

• Integrated 3Com® 10/160 EtherLink® 

Controller 

Controller 

• Factory Installed MS Windows NT 4.D 

* Factory Installed MS® Windows NT® 

with 1 Year Telephone Support 

4.0 with 1 Year Telephone Support 

• 3.5“ Floppy Drive 

• 3.5" Floppy Drive 

• 3 Year Limited Warranty 1 with 3 Years 

• 3 Year Limited Warranty 1 with 3 Years 
Next-Business-Day On-site 4 Service 

Next-Business-Day On-site 4 Service 

$3855 

$5854 

Order Code: 900038 

Order Code: 900013 

HIGH-END WORKSTATION 


DELL WORKSTATION 400MT 


333MHz Pentium II Processor 
(Dual Processor Capable) 

128MB ECC EDO DIMM Memory 
9GB Ultra/Wide SCSI-3 Hard Drive 
Advanced 2D/3D ELSA Gloria XL 
Graphics Card 

1600HS Trinitron 14 (19.8" vis.} Monitor 
24X Max 5 Variable EIDE CD-ROM Drive 
Integrated 16-bit Sound Blaster 
Compatible Sound 


* Integrated 3Com 10/100 EtherLink 
Controller 

* Factory Installed MS Windows NT 4.0 
with 1 Year Telephone Support 

• 3.5" Floppy Drive 

• 3 Year Limited Warranty* with 3 Years 
Next-Business-Day On-site 4 Service 

$6449 

Order Code: 900039 


TO ORDER TOLL-FREE 


800 - 433-3498 



www.dell.com 


Mon-Fri 7am-9pm CT 
Sat 10am-6pm CT 
Sun 12pm-5pm CT 
In Canadarcall 800-839-0148 
GSA Contract #GS-35F-4076D 

| Keycode #99264 | 


DOLL 

www.dell.com 


•Prices and specifications: valid in Ihe U.S. only and subject to change without notice. Ifor a complete copy of our limited warranties, please write Dell USA L.F. r One Dell Way, Round Rock, TX 7B6S2. Attn: Warranties. $Z4X 
Max/tZX Min. 4 0n-site service provided by thind-party providers and may not be available in certain remote areas. .3Com and EtherLink are registered trademarks of 3Com Corporation ‘Reported revenues for last four fiscal 
quartets. The Intel Inside logo and Pentium era registered trademarks and MMX is a trademark of Intel Corporation. MS and Windows NT am registered trademarks of Microsoft Cmp. Trinitron is a registered trademark of Sony 
Corporation, Dell and the Dell logo ere registered trademarks of the Dell Computer Corporation. ©199B Dell Computer Carp. All rights reserved. 















A Year 2000 
Tool Suite 


A date scanner and data 
ager written in Java 

Dev Bhottacharyyo 

C ontrary to popular belief, the Year 
2000 problem poses a threat not 
only to mainframe systems, but to 
desktop and client-server systems. 
Many of these problems derive from pre¬ 
built routines that are not Year 2000 com¬ 
pliant and are regularly used with the cur¬ 
rent offering of visual and RAD desktop 
development tools. Another problem 
unique to client-,server systems is making 
sure every segment of the application that 
manipulates data is compliant. 

To address these problems, program¬ 
mers need both tools that scan source 
code for date-related data structures and 
functions, and those that create test data 
from existing production data. In this ar¬ 
ticle, 111 present two such tools. The scan¬ 
ning tool examines an application's source 
code for date-related areas and offers a 
preliminary analysis of the target applica¬ 
tion, The data ager tool lets you manipu¬ 
late existing production data to create a 
set of test data. Both die date scanner and 
data ager (written in Java 1.1) are avail¬ 
able electronically; see 44 Resource Center,” 
page 3- 

Date Scanning 

The scanner identifies areas in the 
source code that match a dictionary of 
date-related words—words often used by 
programmers to portray dates. The scan¬ 
ner then generates an 1ITML page con¬ 
taining the source code excerpts with im¬ 
pacted areas hypertext linked to a 


Dev is a consultant with Visa Inter na¬ 
tionals Year2000project working on desk¬ 
top and midrange computers. He can be 
contacted at devrnit9 7@aol com. 


date-dictionary URL, Not all of die source 
code in the report are true Year 2000 trou¬ 
ble spots, but it is better to err on die side 
of caution. 

Developers often use real-world terms 
to describe date variables and functions, 
so die scanner looks for words like 44 date/ 
“age/ and “yy-mm.” Example 1 shows a 
list of common “real-world” words. 

However, as Example 2 illustrates, some 
developers use words that have no bear¬ 
ing on what they are supposed to repre¬ 
sent, Examples of such obscure code are 
common, and only the developer knows 



why he used such deviant names to per¬ 
form date arithmetic or store dates. To 
guarantee better analysis, the scanner 
should include programming language re¬ 
served words that are used to declare user 
defined types and dates—“type/ “struct/ 
“class," and so forth. 

To avoid identifying nondate trivia as 
Year 2000 trouble spots (such as “message" 
which contains “age/ and 44 update” which 
contains “date"), I include a dictionary of 
words to ignore. Example 3 contains a list 
of some nondate-related words that can be 
potentially misidentified. If the scanned 
word is in this “ignored words 11 dictionary, 
the scanner disregards the word. 

Figure 1 presents an abridged descrip¬ 
tion of the JavaScan class. The complete 


Java source code (available electronically) 
is well documented. Objects derived from 
standard Java classes in java.io (File, Fileln - 
putStream, StnngBujJer, and the like) and 
java.awt {TextArea, Button, and so on) are 
used in this class. This applet/application 
can run on older browsers supporting JDK 

1 0.x. To recompile die package under old¬ 
er compilers of the JDK 1.0.x era, you must 
use the statements that are commented as 
belonging to JDK 1.0.x. 

When the applet/application is loaded, 
users acknowledge a disclaimer by click¬ 
ing the Agree button. The text field at the 
bottom of die applet/application window 
now has die focus. You must type the valid 
path and name of die file you want to ex¬ 
amine, and click the Open button. The 
source-code file will be loaded and 
scanned, and a report is created in the file 
outpuUuml. You can click on the applet 
hot-spot to view this page, or you can view 
the page from your web browser. 

The scanning tool scans only a single 
source file. One possible enhancement to 
this application would be to scan multiple 
source files, generate statistics on total lines 
of code and date-related fields, and out¬ 
put the report to multiple HTML pages. 

Aging the Data 

There are a number of ways of generat¬ 
ing test data, one of which is to “age” cur¬ 
rent production data. By aging, I mean 
that all dates are increased or decreased 
by a specified number of days or years. 
Each date is increased or decreased by 
the same amount, maintaining the data’s 
referential integrity between dates. Figure 

2 lists the classes used by the prototype 
for the ager project. 

To use the ager, you dump your data 
containing dates into a text file. The ager 
tool allows you to specify where the date 
lies in the text file rows, and the format of 
the date. The tool also lets you specify the 
format in which the date will be rewritten 
(see Figure 3), after which you can reload 
the data back to where they came from. 

I designed this application differently 
from the date scanner. It is not downward 
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WASTING 

TUU DEBUGGING! 




Finds Bugs Automatically 

lnsure++ is a Run-Time Debugger like no other. 

It finds the widest variety of programming and memory errors in your programs, automatically! 

lnsure++ finds all your memory errors, plus uses exclusive Mutation Testing technology to pinpoint 
algorithmic anomalies and other hard-to-find bugs. 

And unlike other tools, lrrsure++ gives you the exact location of memory leaks and other errors as they arc 
found in your source code. You'll stop wasting time searching for bugs anil start finishing projects faster! 


FjM Htlgs Aulontatically with fnsure++ 

<l_ Memory Leaks 
/ Memory Corruption 
/ I/O Errors t t 

V. Logic Errors 
V. 3rd Party Errors 
VI and more! 


U.l . 1 -//, 

For a limited time, you can DOWNLOAD 
E from our website. 


Try against your current debugging toots. You'll be amazed at bow 

finds more bugs automatically, saving you hours-even-days of labor! 


l or UNIX and Windows NT/95 (integrates with Visual G + + ) 
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ParaSoft 

Corporation 


OOWHIOAO lnsure++ today at: www.parasoft.com/insure/eval.htm or call 1-888-305-0041 x 4 







The bugs in your product launch left you stranded on the dark side of the moon. 
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age 

mdy 

tomorrow 

ccyy 

min 

year 

current 

mmm-dd 

ymd 

Pate 

now 

yy-mm 

ddmmyy 

period 

mo$ 

dmy 

time 

interval 

leap 

tm 

qtr 


Example 1: Common “real-world* 
words. 


(a) 

struct dRecord 
{ 

iirt xx; 
int yy; 
int zz; 

}: 

(b) 

int dTemp« mTemp, yTemp; 

w 

Dim dTetnp As Variant 


Example 2: Words that are not 
suggestive of what they are supposed 
to represent. 


breadth 

Imp 

language 

message 

update 

minim 

validate 

know 

percentage 

page 

itemindex 

manage 

bitmap 

html 

flagerror 

femin 

idyes 

section 

image 

agent 

centimeter 


Example 3: Nondate- related words 
that can Ixt misidentified. 


(continued from page 36) 
compatible to JDK 1,0, and it uses the new 
features of AWT anti the language en¬ 
hancements of JDK LI. t implemented ii 
with Borland's JBuilder 1.0, using Bor¬ 
land’s extensions to the AWT classes, 
These extensions can easily be modified 
to suit other compilers* 

You enter the name of the text data file 
(complete path information) and click the 
Open button. Twenty records from the 
data Pile Cor fewer if the data file is small¬ 
er) are loaded as a sample in the 
TextArea component. You then enter the 
record length to correctly align the 20 
records. 

You then select the year component 
and enter the starting point and length 
in the first row r of data in the TextArea 
to mark the year portion of the date. The 
portions you define are highlighted in 
the TextArea t reconfirming your entries. 
A button is provided to place the entry 
in a list for later use. likewise, you must 
mark the date, month, and the full date 
field in the same row. 

You then enter the format into the pre¬ 
set date field* The formal could be any 
combination of M, d r y, and separators. 
Notice the tL M” is in uppercase. After all 
the selections are complete, users click 


Class JavaScan 
Interface: 

Constructor 

Destructor 

GetAppletlnformation 

Action 

// Applet Related 

Init 

Start 

Stop 

Run 

End Class; 


figure 1: Abridged description of the 
JavaScan class . 


Class Date Routines 
Interface: 
Constructor 
Destructor 
setDate 
setMonth 
selYear 
setField 
parse 
End Class 

Class Aging 
Interface: 
Constructor 
Destructor 
getParameter 
//Applet Related 
Action 
Init 
Start 
Stop 
Run 
End Class; 


figure 2: Class representations used 
by the prototype for the ager : 



Figure 3' Running the Data Ager tool 

the Age button, and die aged data is saved 
as “c:\ouUxt". A sample 20 rows are dis¬ 
played back in Lhe TextArea for users to 
confirm whether the results are what they 
wanted. If the results are erroneous, 
changing the parameters and rerunning 
again will achieve the desired output. 

Tliis aging tool can alter only one date 
field at a time. It also can address only 
one table at a time in a database. Tie t(x>l 
is incapable of addressing date-derived 
fields like invoice numbers or purchase 
order numbers that may have portions of 
dates embedded in them. 


DDJ 
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SOO 876 4900 or 1 650 528 3450 Int'l 

www.elements.Gom 


“LIVE BY 
YOUR OWN 
RULES.” 


Make your Web site as dynamic as 
your business and as smart as your 
customers with ELEMENTS ADVISOR," 

Business rules define your company's 
policies, practices and ability to compete. 
Today's dynamic business environment 
means you often need to jump in and 
alter the rules that control your Web 
applications, A rule you can change is the 
one that says you need to embed your 
business rules inside application code. 
Because it separates business rules from 
application logic, ELEMENTS ADVISOR lets 
you put new business rules on-line in 
minutes. And it integrates easily with both 
new and existing code. So make a splash. 
Design a self-service application with a 
new level of interaction. For details visit 
us on the Web today. See a demo and read 
“Business Rules Automation," our new white 
paper on designing "smart’' application 
components at: www.elements.com 


E MAKE THE TddLS." 


ELEMENTS ADVISOR, ELEMENTS. Itie ELEMENTS logo and VOU MAKE THE RULES. WE MAKE THE TOOLS, are 
tiaderTBjks of Neuron Data, Inc. All alter trademarks are the properly of tteir registered owners. ©1998 Neuron Dala. Inc. 027DD 
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ot Processor Information 
-irsr cal1 native method to 
jxectile CPU s CPUIO instruction 
etCPUlD_DalaO ; 

;es$orFamtlyJQ - ProcessorFamiJyO 
ndtnsi ruction Avai Fable = 

Jpuid I nstrodlonSupported (), 
sessorisGenutoeintel = GenuineintefO; 

at processor name 
Ipuid I nstruction Ava liable) 


f Verity its a recent Intel* GPU 
(Pi oce&sortsGenuine Intel] 

// 4 is lowest family number 
U we could see here 
All processor signature 
ti into is available 
switch (ProcessorPamtlylD) { 
case O' 
case 1: 
case 
case 3; 

Programmer s paranoia 
We can t get here' 
p SetProcNamst “1 ; 
break; 


mmmmtnmumumtm 

// 

ff Procedure: scale array elements 

ft 

ff Function: 

U Scale the elements of a floating 
tf point army according to the 
desired scate factor 

il 

MMmtwmmmmimm 

void sea ie_array .elements (float sea te_f actor) 

t 


ft Get raw cache descriptors from 
it native CPU ID assembler routine 
p.GetCPU1D_CacheData (); 

Descriptors Defined = 

GetRawCache Descrip tors (Gpuidlnf* 

fi Reset descriptor output counter 

1 = 0; 

// Parse the descriptors if available 
it (DescriptorsDefined) 


lor (sat = 0: set < 2: set-H-} 


We know you don’t 
write slow code 
so buy it for the genius 
in the next cube. 


case 4 

H Handle the different types 
of imeR86ftm) processors 
separately 
p SettntetFamily4_PfocName |) 

break: 


float scale reap: 


lor (dward - 0, dword * 4 dword 

for (abyte = 0; abyle <- 3; a by 

{ 

Descriptor^] = 

Cpudlnfo[setl[dword3 
& Oxff; 

Cp utdi nf o[set] [dvrard] 

»= 0; 

if (Deserlptor[i) 1= 0) i++; 

II Early out on short butter 
if (I Size) 
return Descriptor; 


return Descriptor 


NOTE: VTune showed this method vs compiled 
// at execution time by the JfT compiler. 


ft Return null on error exit, 
return null; 


iimmmtmmmmmummm 

ft 

ft Procedure: GetProcessorCacheDescnptors 

n 

v Function; 

Use GPU ID instruction to retrieve 
raw data on the processor's caches, 
then extract the data into a simple 
// array. 

ft 

miwmttmmtmmmm 

into GefProcessorCacheDescnptors 
{mil] Descriptor Int Size) 

t 

in) Cputdlnfo[!Il = 

_ liP,Q,o,QUo,o. 0,0] j: _ 


mmmnmmmmmmim 

a 

U Procedure: FpuType 

it 

it Function: 

// Determine type of floating point 
// unit, 

U 

// returns. 

tf FPU .NONE if no floating point unit 
// FPU_aS7 if 207 floating point unit 

it FPU_387 if 387 floating point unit 

it FPU 487SX only if unambiguously * 
Intel 487SX NOP 

// FPU_4B6DX_DX2_4875X il one olln 
Intel486 DX2 f or 

it Intel486 SX processors 

// with 487 SX 

// FPU_ON_CHlP : if Genuine Intel prt>c€ 
if that supports CPU I D 

it and that reports FPU 

// feature flag 

// -1 if this routine cannot 

ft recognize the FPU 

mmmmmminmmmm 


lessor on which this program 
running 

jild P rocessorNameO 


ProcessorFa mily 10=0 
lean CpuidlnstroetionAvailable; 
lean ProcessorlsGenuinelntel. 


luidDlg p = new CCpuidDlg(m_pMamWnd); 


case 5 

Pentium * processor 
p SetProcNaroe (PrccessorNmne 
fPNAM .Pentium . Processor]) . 


if Check for MMX(lm) 
t technology support 
It f ProeessorHas Technology 
f MMX _ TECHNOLOGY)) 
p .Append!ProcName 
(MMX_Technotogy_ Sir |; 


Add to trademark symbols 
p.AppendPracNams { 
Trademark ^Strf; 
p AppendProcNa me ( 

Registered.Tracfemark _ Str); 
break; 


case 6: 

// Pentium* Pro or 
it Pentium' II processor 
if (Pn>cessorHasTechnology 
(MMX .TECHNOLOGY)) 
i 

p.SetProcName t 
ProcessofName [ 

PN AM_Pentium_K_Processcr I); 


scafe_recip= i f - scale factor, 
for (I = 0; L< size; i++) 

f_ array [i] = f array])] * scale_rectp. 


w VTune suggested multiplying by the reciprocal 
ft instead of dividing to the loop - multiply 
// is faster 
t* Original Code 
// for (i — 0; i < size, h-U 

t array[ij = t array[Ff / scale factor; 

I 


1 

else 

// Processor from the Stone Age, 

/ f CPUID inst- not available 
f 

p Se I ProcName (ProcessorName 
(PNAM NO CPUID]); 

i 


// Check parameters 
if (Descriptor — null II Size < 1) 
return null; 


ft Initialize result array, 
for (1 = 0; U Size,)++) 
Descriptor^] = 0; 















VTune 2,5 

Performance Tuning 
Environment 


Nobody likes to admit it but, the 
fact is, performance hotspots happen. 

And tracing and analyzing them 
can be tedious. Now Intel’s new 
VTune 2.5 gives you both an inside 
and system-wide view of how your 
software executes on Intel micro¬ 
processors through a simple, highly 
graphical interface. Whether you’re 
writing in C/C++, FORTRAN or JAVA* 
it shows you the problem areas. 
VTune even offers tuning advice for 
improving performance. So there’s 
no more guesswork. Sec for yourself 
on better yet, show it to the genius 
next door. Download a free VTune 
evaluation and get the complete 
technical overview at our Web site. 

► developer i ntel .com/ad/ Vt25ad4 


Intel. 

The Computer Inside.™ 





HDF: 

The Hierarchical 
Data Format 


An all-purpose 
file format for 
scientific data 

Brand Fortner 

A lthough there Ls no widespread stan¬ 
dard for sharing complex technical 
and scientific data, Lhe Hierarchical 
Data Format (HDF), developed at 
the National Center for Supercomputing 
Applications (NCSA), is a leading candi¬ 
date for such a standard. NASA’s Mission 
to Planet Earth Project, an $8 billion effort 
to monitor the earth’s environment, for ex¬ 
ample, uses HDF as its baseline data stan¬ 
dard for its million-gigabyte data arcliive. 
The HDF software is available in source 
and precompiled form for Windows 
95/NT, Macintosh, and UNIX, and can l)e 
downloaded at no charge from http:// 
hdf ncsa. u iuc. edit/. 

Hie HDF subroutine library is designed 
to lie easy to use. To write a C character 


Brand is the chairman of Fortner Software 
LLC and coauthor of Number by Colors 
(with Ted Meyer) and The Data Hand¬ 
book, both from Springer-Vedag. He can 
he contacted at brand©fanner.com, 


array that represents an eight-bit color im¬ 
age Lo an HDF file, for example, the only 
HDF call that’s required is ret=DFR8add- 
imagef image 1, rotes, cols. Oh , 

which creates and initializes the file my- 
flle.hdf, adds an HDF directory entry to 



the file, and stores die image as an HDF 
object. Although written in C, HDF in¬ 
cludes a set of wrapper functions that 
make it accessible from Fortran. 

The NCSA HDF Library 

The HDF library has three layers low- 
level interface, single-file applications, and 
multifile applications. Most HDF devel¬ 
opers work w ith the last two layers, which 
hide the low-level details. 


Each layer contains a number of in¬ 
terfaces, each of which contains func¬ 
tions that encapsulate a particular oper¬ 
ation or data structure. All functions in a 
single interface begin with Lhe same let¬ 
ters, Table I lists Lhe HDF interfaces, 
grouped by layer. 

Two HDF interlaces support multi¬ 
dimensional arrays, called “scientific 
datasets” (SDS) in the HDF documenta¬ 
tion. HDF 3 3’s SDS interface, which has 
largely replaced the older DFSD interface, 
supports simultaneous access to more than 
one File at a time* 

The SDS interface supports multi¬ 
dimensional arrays containing 8-, 16 -, or 
32-bit signed or unsigned integers, or 32- 
or 64-bit floating-point numbers* This in¬ 
terface lets you specify die ccxirdinate sys¬ 
tem used by the data, and die scale and 
units associated with each axis. You can 
specify how values should be formatted 
for display, a label for each variable, and 
store the maximum/minimum values for 
reference* 

The Vdata model (VS, VSQ, and VF in¬ 
terfaces) makes it easy to store tables of 
data in HDF files. Each table consists of 
a series of records, each of which con¬ 
tains a series of fields. Each field can sup¬ 
port its own number type. Valid field types 
include 8-* 16-, 32-bit signal and unsigned 
integers, 32- and 64-bit floating-point 
numbers, and ASCII characters* 

Vdata tables use three kinds of identi¬ 
fying information: 
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Make Sense of Complex Data 




W////////M 


The Graph Layout Toolkit’s powerful component technology gives 
you the tools to visualize complex relational information. With 
sophisticated layout technology, the positions of objects and their 
connections in diagrams, networks and schemata are automatically 
calculated in seconds. The Graph Layout Toolkit’s unparalleled 
techniques are a result of Tom Sawyer Software’s pioneering 
development of graph layout components. 


Hundreds of tailorable features are encompassed in the Graph 
Layout Toolkit’s four layout libraries: Circular, Hierarchical, 
Orthogonal and Symmetric. These libraries provide the visual 
enhancement required to make your application more effective. 


Benefits of the Graph Layout Toolkit 


Automates the positioning of complex data eliminating manual 
layout. 

Scales automatically to support an unlimited number of nodes 
allowing complete access to complex diagrams. 

Graph editing operations such as cut, copy, and paste provide 
smooth manipulation of graph components. 

Automatic edge label placement allows immediate interpretation 
of the drawings. 

Ability to expand and collapse sub-drawings provides easy 
navigation. 


before 


after 




download a demo at 
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lorn sawiier 


www.tomsawyer.com 
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Tel: +1 510 848 0853 E-mail: info@tomsawyer.com 




































(continued from page 42) 

■ A name describes the origin and con¬ 
tents of a table. 

* A class identifies the meaning of data. 

* Field names identify the individual fields 
that make up a record. 

Example 1 is a Vdata table with three 
fields and four records, 

The Vdata VS interface provides rou¬ 
tines for reading/writing tables, the VSQ 


interface routines for querying Vdatas, 
and the VF interface routines for manip¬ 
ulating individual Vdata fields. 

The Vgroup interface provides routines 
for reading/writing groupings of HDF data 
objects in a particular HDF file. Each 
Vgroup may contain any number of oth¬ 
er HDF data objects—even other 
Vgroups. In addition to its members, a 
Vgroup may also be given a Vgroup name 
and Vgroup class. Every function of 


Vgroup begins w ith the prefix V. The VTI 
interface contains routines for simple cre¬ 
ation of Vgroups and Vdatas. 

HDF Examples 

Listing One (listings !>egin on page 109) 
takes an eight-bit raster image (stored as 
a two-dimensional array of eight-bit val¬ 
ues), creates a linear grayscale palette, and 
stores this to an HDF file. 

Scientific data is often stored in multi¬ 
dimensional arrays. For example, a biolo¬ 
gist might measure the plant density in 
each square of a large grid, or an as¬ 
tronomer might have a computer simula¬ 
tion that provides the density at each point 
in a large gas cloud. 

Listing Two illustrates how to store a 
three-dimensional array of data (includ¬ 
ing some related comments) in an HDF 
file using die SD interface. Listing Three 
retrieves that same data. Listing Three 
doesn’t need to know in advance tire size 
or dimensionality of the data—it can read 
all of that information From the file. 

The HDF Disk Format 

An HDF data file consists of a magic num¬ 
ber (the control characters A N A C A SM} t a 
directory that points to data elements, and 
the data elements themselves. 

The HDF directory entries—known as 
“Data Descriptors” (DD)—arc organized 
into a series of DD blocks. The HDF di¬ 
rectory Cor DD list) consists of a series of 
these blocks. Each DD block contains two 
bytes indicating the number of DDs in that 
block, a four-byte integer with die loca¬ 
tion of the next DD block Cor 0 if tins is 
the last one), and then the actual directo¬ 
ry entries. Figure I shows the l>eginning 
of a simple HDF file with one DD block 
containing two DDs, 

r fhe location of the next DD block en¬ 
try (like all locations within an HDF file) 
is given in bytes from the beginning of 
the HDF file. Since all locations are given 


Location 

Value 

Comment 

0 to 3 

A N A C A S A A 

Unique HDF 

Number ( A N = 
control-N, etc) 

4 to 5 

2 

Number of DDs 
in DD block 

6 to 9 

0000 

Location of next 

DD block 
(none here) 

10 to 21 

— 

Data Descriptor#! 

22 to 33 

*— 

Data Descriptor #2 


Figure 1: Beginning of a simple 
HDF file. 


123456789 10 It 12 
-T ag-l-Ref-l—Offset—I-—-Length-— -1 


Figure 2: HDF Data Desc riptor 
layout , 


Interface 

Description 

Example Routines 

(a) 

H 

low-level I/O, directory, query 

Hopen, Hread, Hwrite, Hcreate 

HDF 

new version of low-level routines 

HDFopen, HDFclose 

HE 

low-level error reporting 

H Ere port, HEprint 

W 

DFR8 

read, write 8-bit raster images 

DFRSaddimage, DFRSgetdims 

DFP 

read, write color palettes 

DFPaddpal, DFPgetpal 

DF24 

read, write 24-bit raster images 

DF24addimage, DF24se1dtms 

DFSD 

single file scientific dataset 

DFSDputdata, DFSDsetdimscale 

DFAN 

text annotation records 

DFANputlabel, DFAMgetdesc 

<e) 

SD 

multifile scientific dataset 

SDstart, SDcreate, SDdiminfo 

NC 

netCDF interface 

nccreate, ncopen, ncvardef 

VS 

Vdata interface 

VSattach, VSfdefine, VSgetid 

VSQ 

Vdata query 

VSQuerycount, VSGueryname 

VF 

Vdata fields inquiry 

VFfieldsize^ VFfieldname 

V 

Access, Specify, Inquire Vgroups 

Vattach, Vstart, Vsetname, Vgetid 

VH 

Simple Vdata, Vgroup creation 

VHmakegroup, VHsto redata 


Table 1: HDF interfaces by layer (a) low-level layer interfaces; (h) single-file 
application layer interfaces; (c) multifile application lay>er interfaces. 


Tag Name 

Tag# 

Comments 

(a) 

DFTAG RLE 

011 

Specifies the run length encoding used for image. 

DFTAG„TID 

102 

Tag identifier: text string of user defined tag. 

DFTAG DIL 

104 

Data identifier label: used for titles of elements. 

DFTAG DIA 

105 

Data ID annotation: lengthy text annotation block. 

DFTAG NT 

106 

Number type: values are float, integer, text, etc. 

DFTAG_MT 

107 

Machine type: specifies IEEE or local computer type. 

w 

DFTAG JD 

300 

Image dimension: gives X and Y size of assoc, image. 

DFTAG LUT 

301 

Lookup table: color lookup table for assoc, image. 

DFTAG Rl 

302 

Raster image: points to actual image data. 

DFTAG RIG 

306 

Raster image group: lists all DDs assoc, with image. 

DFTAG^LD 

307 

LUT dimension: size of color lookup table. 

DFTAG.CFM 

311 

Color format: grayscale, pseudocolor, RGB, HSI, etc. 

(C) 

DFTAG SDD 

701 

SDS dimension: dimension sizes of scientific dataset. 

DFTAG SD 

702 

Scientific data: points to actual scientific dataset. 

DFTAG SDS 

703 

Scientific data scales: Arrays for X,Y,Z locations. 

DFTAG SDL 

704 

SD labels: text describing data and dimensions. 

DFTAG SDU 

705 

SD units: text with units for data and dimensions. 

DFTAG SDF 

706 

SD format, text with format code for displaying data. 

DFTAG SDM 

707 

SD max/min: minimum/maximum valid values for data. 

DFTAGJ3DC 

708 

SD coordinate system: text string defining CS. 

DFTAG„NDG 

720 

Numeric data group: lists ail DDs assoc, with SD, 

W 

DFTAG VG 

1965 

Vgroup: provides general-purpose grouping of DDs, 

DFTAG VH 

1962 

Defines the structure of a Vdata data element. 

DFTAG VS 

1963 

Points to Vdata data element using DFTAG_VH format. 


Table 2: A selection of NCSA defined HDF tag types ; (a) utility tags; (b) raster 
image tags; (c) scientific dataset tags; (d) Vgroup/Vdata tags , 
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in 32-bit signed integers, the maximum 
size of a self-contained HDF file or of a 
single data element is about two gigabytes. 

Every data element (image, array, an* 
notation, and so on) in the HDF file has 
an associated Data Descriptor in the DD 
list. Every DD is 12 bytes long and has 
four fields; see Figure 2. 

The Tag field, which defines the data 
element type, is defined as a 16-bit un¬ 
signed inLeger, which means there are 
65,535 possible types of data dements, 
(0 is not a legal tag number,) The possi¬ 
ble tag values are divided into three 
ranges: Those below 32,768 are assigned 
by NCSA, those above 64,999 are re¬ 
served, and the rest can be user defined. 
Table 2 shows some of the data element 
tag types, 

The HDF library assigns a reference 
number (Ref ), which identifies data ele¬ 
ments with the same Tag, to each data ob¬ 
ject as it is written to die file. The library 
keeps track of the reference numbers that 
have been used in the file and guarantees 


that no two data objects will have the same 
tag and reference number in the same file. 
A Tag/Ref pair can then be used as a 
unique ID (known in HDF terms as a 
"data identifier") to unambiguously iden¬ 
tify particular data elements, 

HDF is both a 
subroutine library 
and a file format 


It is allowable for two objects to carry 
the same tag (such as our two raster im¬ 
ages), or for two objects with different 
tags to carry the same reference number 


(such as a raster image and an SDS). How¬ 
ever, you shouldn't assume that objects 
with the same reference number have any 
special relationship. Instead, HDF uses 
Groups to define relationships. 

Relationships between objects are han¬ 
dled with one of three group objects. The 
Raster Image Group Tag (DFTAG_RIG, 
tag 306) groups objects associated with a 
particular raster image. The Numeric Data 
Group Tag (DFTAGJNDG, tag 720) groups 
data objects associated with a particular 
scientific dataset. The Vgroup tag (DF- 
TAG_VG, tag 1965) supports a generalized 
grouping of all types of data objects, even 
odier Vgroups. 

Group objects contain a List of Tag/Ref 
pairs for each object in that group. From 
that, a program can locate the DD entry 
for the object, which gives the offset and 
size of the object in the file, (An Offset 
field gives the location of the data element 
in bytes from the start of the file, while 
the length field gives the length of the 
data element in bytes.) 


The Future of HDF 


Mike Folk 

S cientists typically use one computer to 
render results for visualization, and an¬ 
other to further analyze and visualize 
the data. Furthermore, they frequently 
share data with colleagues* The need to 
use a mix of computers and transport large 
amounts of data among many different 
computers was an early data management 
problem for numy scientists at the Uni¬ 
versity of Illinois National Center for Su¬ 
percomputing Applications (NCSA)* 

In response. NCSA developed the Hi¬ 
erarchical Data Format (HDF) in 1988. 
NCSA HDF is a portable, self-describing 
data formaL for moving and sharing sci¬ 
entific data in networked, heterogeneous 
computing environments. HDF can store 
several different kinds of data objects, 
including multidimensional arrays, raster 
images, color palettes* and tables* It al¬ 
lows individual scientists to mix and 
group different kinds of data in one file, 
according to their needs. NCSA provides 
a library of APIs for reading and writ¬ 
ing HDF as well as workstation tools for 
visualizing data stored in HDF files. 

Although HDF has evolved to meet 
new requirements, support new kinds of 
scientific data and applications, and op- 


Mike is the HDF Project Manager at 
NCSA. He can be reached at mfolk® 
ncsaMiuc.edu * 


erate effectively in new computing en¬ 
vironments, some important new re¬ 
quirements seriously test die original de¬ 
sign of HDF* Examples of these new 
requirements include: 

• The need to store very large objects (the 
current HDF limit is two gigabytes), 

• The need to store large numbers of ob¬ 
jects (die current limit is 20,000 objects), 

• More general, flexible data models, 

• Performance improvements* 

• Compatibility with object-oriented 
databases and distributed-object tech¬ 
nologies. 

To address these new needs, the NCSA 
IiDF project is working on a prototype 
for the next generation of HDF, code- 
named "HDF 5A Current plans call for 
three fundamental changes in HDF 5: 

Unified data model* The proposed data 
model will support only one datatype: a 
multidimensional array of atomic ele¬ 
ments, The new otiject will have two re¬ 
quired attributes: dimensionality (the 
number and sizes of dimensions) and a 
data type (a definition of the array ele¬ 
ments type). More data types will be sup¬ 
ported, including record structures. Ob¬ 
jects will include optional user-defined 
attributes of the form “parameter = val¬ 
ue,” Users will specify optional physical 
storage schemes for the data, such as 


compressed storage and possibly an in¬ 
dexed structure. For backward compat¬ 
ibility, die new HDF object is designed 
so that all current objects can be defined 
as subtypes of this basic object type. 

New file structure* The new file struc¬ 
ture will support files and objects of any 
size and any number of objects. The in¬ 
ternal structure for describing objects is 
simpler than the current structure and 
should provide faster, easier access to 
objects. 

New I/O library. In planning the 
next-generation HDF library, NCSA de¬ 
velopers hope to exploiL similarities be¬ 
tween HDF and other popular scientific 
data formats by building a system that 
understands a variety of different data 
models and formats. APIs at the top lev¬ 
el allow programs to view data accord¬ 
ing to a variety of different data models. 
These APIs communicate with the mid¬ 
dle layer that interprets their requests in 
terms of a common model. 

The service layer consists of different 
file-format drivers, each of which reads 
from or writes to one file format. Each 
driver has a well-documented interface 
for transferring objects and lists of ob¬ 
jects to the higher arbitration layer. Pos¬ 
sible drivers in the first implementation 
include HDF, BigHDF, netCDF, and FITS. 
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Both Raster Image Groups (RIG) and 
Numeric Data Groups (NDG) consist of 
nothing but a Jisl of data identifiers, each 
four bytes long. Each is restricted to con¬ 
tain only certain types of tags. 

Vgroups do not have this limitation: 
They can contain any collection of Data 
Identifiers, including a Vgroup Data 
Identifier. This lets you construct tree- 
structured organizations similar to disk di¬ 
rectories. 

Example 2 shows an HDF file with 
Vgroups. (1 have cheated a bit in tliis table, 
because the structure of Vgroup data el¬ 
ements is a bit more complicated than 
what is shown.) 

This file has three objects, all listed in 
the DD list. The first VGroup refers to 
the second VGroup, which refers to the 
image size object. As with pointers in 
C, it’s possible to create circular links; 
it is the responsibility of the data pro¬ 
ducer to ensure lhai the structure makes 


HDF Scientific Datasets 

The most important data objects in I IDF 
files are scientific datasets, which are multi¬ 
dimensional arrays of numbers. The ar¬ 
ray can have any dimensionality up to 
32,767, and consist of floating-point or 
integer values. 

The DFSD Interface. An SDS, whether 
created by the DFSD or SD interface, con¬ 
sists of several data objects that are asso¬ 
ciated with die SDS, These data objects in¬ 
clude attributes, numerical scales, die actual 
data, and so on. In die DFSD interface, all 
the DDs of the data objects that are asso¬ 
ciated with a particular SDS are listed in a 
numeric data group (DFTAG_NDG), Some 
older HDF files use die roughly equivalent 
scientific data group (DFIAG_SDG) to col¬ 
lect its members. In either case, only sci¬ 
entific data set tags and a few r utility tags 
arc allowed in a numeric data group. 

The data identifiers siored in the NDG 


sense for the application. 


for this SDS are in Example 3, along with 

Vdata Name: 


Flux Table 




Vdata Class; 


Class.Table 






Field if I 


Field #2 

Field if 3 

Field Types 


inti 6 


float32 

char[5] 

Field Names 


ID 4 


Flux 

Name 

Record #1 


1 


2.34 

CygXl 

Record #2 


2 


-89.43 

HerXl 

Record #13 


3 


0.0023 

CygX3 

Record 44 


4 


1.115 

Vela 

Example 1: Example Vdata table. 

Location Length 

Value Comment 




0 to 3 4 

A N A C i 

'S A A Unique HDF number 


4 to 5 2 

3 

Number ■ 

of DDs 

in block 


6 to 9 4 

0000 

Loc. of 

next 

DD block 



Tag 

Ref 

Offset Length 


10 to 21 12 

300 

001 

46 

4 

DD #1 (Image size) 

22 to 33 12 

1965 

001 

50 

22 

DD If2 (Vgroup til) 

34 to 45 12 

1965 

002 

72 

22 

DD tr3 {Vgroup #2) 


46 to 49 


50 to 71 
72 to 93 


22 

22 


X Size 
300 

Name 
' Vone 1 
' Vtvo' 


Y Size 
200 


Image size 


Tag/Ref 

1965/002 First VGroup 
300/001 Second VGroup 


Example 2: Organization of an HDFfile with two Vgroups. 


Tag Name 

Value(s) 



Comme nt s 

DFTAG_DIL 

'Neutron Star 

Accretion Simulation' 

Title 

DFTAG.SDD 

2 Dimensions: 

4 10 


Number and size 
of dimensions 


Float32 

Float32 

Float32 

Number types 

DFTAG„SDS 


-1,67.♦ 

,1.70 30.75.,.32.11 

Numerical scales 

DFTAG_SDL 

'Density 1 
'gm/cm* 3 1 

'Time 1 

'Radius' 

Labels 

DFTAG.SDU 

' see T 

t km" 

Units 

DFTAG-SDF 

F7.4 

F5,2 

F5.2 

Numerical Formats 

DFTAG_SD 

'0.5372,0,5872, ... 

,1.4319,1,4352' 

Actual data 


Example 3: Data objects for an example SDS. 
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the information pointed to by lhe data 
identifiers. I ignore in ibis list the actual 
format of the data identifiers, and con¬ 
centrate instead on the information that 
they point to. The Ref, Offset , and Length 
fields for each DD are not shown. 

Hie DFTAG_SDD tag says that the data 
set consists of two dimensions, sized four 
and ten elements, respectively, where the 
data and both scales consist of floating¬ 
point numbers. The numerical scales are 
set by DFTAGJSDS, with a four-element 
array for the first dimension and a ten- 
element array for the second dimension. 

The SD Interface. The basic ideas be¬ 
hind the SD version of the SDS are the 
same as those in the DFSD SDS, but some 
changes were needed in order to achieve 
the major design goal compatibility with 
netCDF. This driver gave rise to three dis¬ 
tinct differences between the two imple¬ 
mentations. 

First, rather than using a specialized tag 
such as DFTAG_NDG to group its ele¬ 
ments, the SD interface uses a general- 
purpose Vgroup, Using a Vgroup allows 
for more flexibility in die organization of 
the SDS widiout a complete redesign. Sec¬ 
ond, the handling of dimensions has 
changed considerably, in that they can 
now have their own attributes. Third, the 
SD interface allows the use of user- 
defined attributes, not just the ones that 
are predefined. What follows is a list of 
the different styles of SDS, in order of their 
adoption into Lhe HDF library, 

* Scientific Data Groups (SDG), an obso¬ 
lete form of SDS that only supports 32- 
bit floating-point arrays. 

* Numeric Data Groups (NDG), the disk 
format for NDG groups (lag value = 
DFTAG_NDG). 

* Multifile SDS, the SD interface (after the 
prefix of the subroutine calls) uses 
Vgroups, Vdatas, and many of die parts 
of die NDG to create structures dial are 
compatible widi netCDF data structures. 

These subroutines allow several HDF flies 
to remain open simultaneously, something 
not passible with die earlier libraries, Tills 
interface can also assign arbitrary attributes 
to SDS. tn the documentation they are also 
referred to as “New Style” SDS, or (con¬ 
fusingly) just SDS. The multifile interface 
will recognize all older forms of the SDS, 

HDF Vdata 

1 /data tallies requires the use of two tags: 
DFTAG_VH for naming the columns of 
values; and DFTAG_VS for pointing to the 
actual Vdata data itself. There Is no ex¬ 
plicit grouping needed for Vdata; related 
DFTAG_VS and DFTAG_VH records have 
the same reference number. This is the 
only case where die HDF libraries use the 
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reference numbers explicitly to associate 
two data elements. 

Example 4 shows the Vdata tags need¬ 
ed for Example 1. Again, we focus on die 


information pointed to by the tags and ig¬ 
nore the actual binary formats. In particu¬ 
lar, we have assigned names to the various 
fields defined in the data record pointed 


to by DFTAG_VH, These Field names are 
also used in the HDF documentation, 

The DFTAG_VH data element defines 
a Vdata with three fields (columns) and 
four Vdata records (rows). The values in 
these fields are defined as short (two- 
byte) integer, four-byte floating point, 
and five-character ASCII text, respectively. 
In addition, each field has an ASCII text 
name. Note how the entire Vdata defini¬ 
tion can be named (<name>X This is un¬ 
usual; most HDF data objects require a 
separate tag (DETAG_DIL) to get a name. 

The data pointed to by the DFTAG_VS 
tag is stoned packed. In this example, each 
record requires 2+4+5=11 bytes, and the 
complete table takes 11x4=44 bytes. 

HDF Extended Tags 

The organizational features of HDF are so 
powerhil that it is usually possible to store 
a complete data product granule in a sin¬ 
gle HDF file. There are, however, a couple 
of problems with very large single-file data 
products. 

The first problem is that I IDF files are 
limited to two gigabytes in size. The sec¬ 
ond problem is that HDF data elements 
were not initially designed to be append- 
able. Normally, data elements are required 
to be in contiguous storage and they are 
packed right next to each other Therefore, 
to make a data dement bigger, it needs to 
be copied to Lhe end of the file where there 
is room to grow, leaving a gap in die file. 

To get around these limitations, NCSA 
has defined “extended tags,’* which allow 
data element to be spread among multiple 
locations in the HDF file or even in a com¬ 
pletely separate file 1 , A data clement corre¬ 
sponding to any NCSA defined tag value 
can (in principle) be converted to an ex¬ 
tended tag, although this functionality is 
currently only supplied for SDS and Vdata. 

An extended tag DD does not point di¬ 
rectly to the data, as do normal tags. It in¬ 
stead points to a data object defining 
where the data is and how it is stored. Tills 
data object may point to the beginning of 
a linked list of data blocks that contain the 
entire data record. This way, a data record 
can be lengthened just by adding an ad¬ 
ditional data block; the entire HDF file does 
not need to be rewritten. 

Alternatively, the extended tag record 
could define the data element as being 
stored in an external element in another 
disk file. This feature (also found in CDF), 
lets users get around the two-gigabyte to¬ 
tal file size limitation. The reference to die 
external file is explicit: That file must exist 
exactly as specified by directory and name. 
This is not compatible with moving files ex¬ 
tensively between disparate computers, 

DDJ 

(Listings begin on page 109-) 


Tag Natne 

Field Niitno 


Value(s) 


Comments 

DFTAGjm 

<nvert> 


4 


Number of records 


<isize> 


11 


Row width in bytea 


<nfields> 


3 


Number of fields 


<type> 

inti 6 

float32 

char[5] 

Field number types 


<isize> 

2 

4 

5 

Field size in bytes 


<offset> 

0 

2 

6 

Byte offset of field 


<fldnmien> 

5 

4 

4 

Length of field Name 


<fldnm> 

'ID.No' 

'Flux' 

r Name' 

Field names 


<nameleu> 


19 


Length of Vdata name 


<name> 

'Vdata Example Table' 

Vdata name 

DFTAG.VS 


1*2* 34, f CygXl f ,.. 

./Vela' 

Actual data 


Exa mple 4: Vdata tags for Example 1. 
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Asynchronous image and 
applet loading. 

Persistence - serializes 
current state. 

Background images. 
Document history. 
Lightweight (only 120kb). 
Fast rendering! 
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Microsoft Windows CE 2.0 is 
the greatest embedded operating 
system in the world! You’ll get to leverage 
the billions of benefits supplied by a supported, 
standardized embedded platform. 


Marketing Truth 


Working with the componentized operating 
system is a snap with the award-winning 
visual development environment! 

Wow! Talk about a giant 
leap for mankind! 


Windows CE 2.0 
is a new operating 
system for my 
embedded applications. 
I can select only the parts 
I need. Lots of hardware and 
software companies support it 







The Microsoft Windows CE Embedded Toolkit for 
Visual C++ 5.0 supports the Win32 API and includes 
an integrated development environment. 


This is good. 
























The Super-Fast, Extremely Vast 
Personal Storage Drive, 



$299 external drive 

1 gig Jaz* cartridges for as low as $89,95* 


and a whole mess of GOO* into a 

Zf INCH SQUARE.” 

' 

BEN WEISS 

Imaging Scientist 
Metatreations 


Each cartridge has a huge 


one gigabyte capacity 

1-Step 71 " backup software protects 
as much as Z gigs {com pressed0 


Access time 

15.5ms read/17.5 ms write 




Average seek time 

10ms read/12ms write 

Maximum sustained transfer rate 

6.62MB/sec max 


v h.v3i 
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t Assuming 2:1 compression ratio. Actual compression will 
vary with file and hardware configuration. 

IftotolMKl will vary when using 1 GB cartridges. 

©199a fomEga Corporation. Ioiiwga. the fomega topy and Ja 7 ire leistered 
Irademajis, am t "The Super-Fast, Extremely Vast PensoFtal Storage Drive, - 
■Because It's Your Sturt, - and l-Step an; trademarks of Iomega Corporation. 
■Kars WsrtO Soap and Wi fewer GGG are trademark wf MrtaCreations 
Inc. AU other trademarks are tte property of their respective Elders, fee 
vkews expressed harein are the views of tlie endorse* and are not the vifwi 
erf. and do not cunstfeutf in endan»,niirt by, any person or firm for whom 
the endorser ha* provided service*, "When purchased in multi-parks, Prices 
listed are estimated street prices. Actual prices may vary, PCS capacity 
when? 1GB-1 triUsoci bytes. TTie capacity reported by your operating 
system may differ, depinkfing an foe operating: system revolting utility. 


They call Ben Weiss an "Imaging Scientist," Sounds cool, but 
hes much more than that. He's a top U.S. scholar, a pianist, a 
surfer, and one of MetaCreations' key programmers behind Kai's 
Photo Soap “ and the world's #1 graphics title, Kai's Power GOO.'” 
And whether Ben's backing up gigantic files, transporting or 
mirroring his work, he needs space. Big space. Space like the 
high-capacity Jaz" drive. 

So while Ben's work is big enough to show up on computers 
around the world, thanks to handy, removable Jaz disks, its small 
enough to take with him to the beach. 

Check out more of Bens story at www.4inchsguare.com/ddj. 

lhe his"-P e ' forma " Ce sta " da,d in removal,, e 

„•,* one million |az 1GB drives already out 

tvr0 














Two times the capacity and over 
30% faster than the 1 gig Jaz drive. 




$649 external drive 

2 gig Jaz cartridges for as Low as $149*95*’ 

Each cartridge has 
two monster gigabytes of capacity 

2GB is read/write compatible with IGBtr 

1-Step™ backup software can protect 
up to 4 gigs (compressed*) 

Access time 

15.5ms read/17,5ms write 

Average seek time 

10ms read/12ms write 

Maximum sustained transfer rate 

SJMB/sec. max 


For information about connecting 
your Jaz drive to a Mac or PC, 
see your reseller or visit us at 

www.iomega.com 
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Algorithms for 
High-Precision Finite 

Differences 


Improving the accuracy 
of numerical methods 

Michael Herstine 

I n developing applications for science 
and engineering, code reuse is just as 
desirable as in other areas of software 
development, but ran often have obsta¬ 
cles peculiar to applied mathematics In 
particular, it is common to use the results 
of legacy code as input to new, higher- 
level algorithms. In practice, this amounts 
to coding numerical methods where a sin¬ 
gle function evaluation (from the stand¬ 
point of the algorithm) is, in fact, a run of 
large amounts of code. Often, that code is 
single-precision legacy Fortran 77, 66, or 
even Fortran IV. 

In such cases, analytic derivatives are not 
available, and must be estimated. Howev¬ 
er, the level of error incurred must be con¬ 
sidered. Unfortunately, die subject of finite- 
dilTercncc estimates of derivatives is largely 
absent from the applied mathematics liter¬ 
ature, Researchers are typically interested 
in issues such as convergence rates, radii 
of convergence, and breadth of applica¬ 
bility of methods. Little attention is paid to 
die accuracy of results compared to avail¬ 
able precision. Numetrical experiments are 
typically performed in double precision, 


Michael is a C++ developer at Northern 
Research and Engineering Carp. He can 
he reached at mgh%woridstd,com. 


and results reported lo between five and 
seven significant figures. When derivatives 
are needed, they are usually obtained via 
forward differencing (to keep the total 
number of function evaluations low) with 
some fixed step size. The availability of 16 
figures of accuracy makes die error asso¬ 
ciated with this method acceptable. 

In tills industry, however, we often don't 
have the luxury of throwing away half of 



our significant digits, especially when forced 
to work with single-precision code. In this 
article, 111 implement techniques for deal¬ 
ing widi this situation, and present the re¬ 
sults of numerical experiments. 

1 encountered die particular develop¬ 
ment problem that prompted this article 
while working on a software package that 
predicts the performance of compressors. 
Typically, users adjust various parameters 
(such as RPM, amount of fluid moving 
through die compressor, or die physical 


dimensions of the part) in an attempt ro 
achieve peak perfomiance while still meet¬ 
ing design criteria for weight, overall size, 
and so on. 

In an attempt to automate some of the 
simpler problems that the designer was 
solving by hand, we hooked the package 
up to a constrained optimizer. The idea 
was to have the optimizer vary selected 
inputs to the package to maximize per¬ 
formance while still satisfying various con¬ 
straints. One of the challenges we need¬ 
ed to overcome was die tack of analytic 
derivatives. The application was already 
fairly complex and solved many of its sub- 
problems iteratively. The code would need 
to be treated as a black box, forcing us to 
approximate die derivatives numerically. 
Further complicating 11 lings, the core anal¬ 
ysis routines were made up of tens of thou¬ 
sands of lines of Fortran 77 written in sin¬ 
gle precision. Thai is, we were be ginn ing 
with only about seven digits of accuracy, 
some of which were lost during the anal¬ 
ysis. The additional loss of accuracy in¬ 
curred by naive use of forward differences 
essentially prevented ihe optimizer from 
converging to any real accuracy. 

Theoretical Basis 

111 ere are two primary sources of error in 
all finite-difference estimates of derivatives: 

• Truncation error, stemming from our 

having thrown away higher order terms 

in the Taylor series. 

• Condition error, stemming from die finite- 

precision available on a computer. 

I’ll turn first to die truncation error in 
forward differences. (For more information 
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ONLY NuMEGA TrueTime™ PROVIDES SEAMLESS 
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(continued from page 52) 
an tliis topic, see Practical Optimization , 
by Philip Gill et al, Academic Press, 1981, 
ISBN 0-1-228-3952-8.) 

For more information on Taylor’s the¬ 
orem (Example 1(a)), turn to any standard 
work on calculus. Solving for df/dx leads 
immediately to the forward difference for¬ 
mula in Example 1(b) for estimating the 
first derivative. The second term on the 
right, which is dropped in making the es¬ 
timate, is referred to as the "truncation er¬ 
ror” (because we are truncating die Tay¬ 
lor series for/). If the calculations were 
being performed to infinite precision, dial 
would be the end of the story— small¬ 
er step sizes give smaller errors. Of 
course, that’s not the case. Since /is be¬ 


ing evaluated on a computer to a fixed 
precision, there is some approximation 
error. Let g(x) be the computed value of 
f(x). That is, g(x)=fix)+eps(x) (so that 
eps(x) is the lloating-point error in com¬ 
puting/ at x on a computer). If/is a sim¬ 
ple function, eps would typically be 
bounded by [fix) \ eps(M), where eps(M) 
is the machine precision (2-^~10 7 for 
floats, 2^10^ for doubles). However, for 
more complex functions, eps(x) may be 
considerably larger. Taking this into ac¬ 
count, Example 1(c) is die actual estimate. 
Tlie last term, the error term, is referred 
to as “condition error,” 

You can now see how things get in¬ 
teresting . Truncation error gets smaller 
as the step size decreases, but the con¬ 


dition error grows. Suppose you can 
bound 


by some constant, say M (at least near xX 
Suppose too, that you can bound \eps(x) \ 
by another constant, E f near x I'll denote 
(g(x+h)— g(x))/h by <p(x) (in other words, 
0fjc)will be die computed estimate of die 
derivative); then | $(xXdf/dx(x) \<Mfy2+2Efo. 

Clearly then, there exists an optimal val¬ 
ue for h\ the first term on the right in¬ 
creases with h T while the second term de¬ 
creases with it. To find that optimum, you 
differentiate die right side with respect to 
h to obtain Example 1(d). If you set this 
equal to zero and solve for b, you get Ex¬ 
ample 1(e), 

When the step size is to be fixed (this is 
usually done for performance reasons), the 
square root of the machine precision is of¬ 
ten used. An identical analysis yields the 
following step sizes for other formulas: 

* Example 2(a) shows die central differ¬ 
ence approximation to the first deriva¬ 
tive, where in this case M Is an upper 
bound on df/dx 1 *, 

* Example 2(b) is the central difference 
approximation to the second derivative, 
where in this case M is an upper bound 
on df/dx\ 

If you do have analytic derivatives avail¬ 
able, die aforementioned observations can 
be turned around to estimate E f the max¬ 
imal error in evaluating/ Recall die bound 
on the error in forward differencing: 
\§(x)—df/dxix) | <Mb/2+2E/b. Notice that 
if b is small enough, the error bound 
will be dominated by the second term 
on the right side. Under this assump¬ 
tion, you have \<p(x)-df/dx(x)\<2E/b, 
or E=(){ | <p(x)-df/dx(x) \h). 

In other words, assume you know the 
correct value of df/dxix). You can inten¬ 
tionally induce error in the forward dif¬ 
ference approximation to dj/dx(x) by in¬ 
tentionally choosing a very small step size. 
This will cause the condition error to lial- 
foon up, and the truncation error to die 
off, so you can drop the first term in our 
upper bound on 1 0j-df/dx(x) \ . Then, 
since df/dx(x) is known, you can solve 
for £, and so estimate how much floating¬ 
point error is creeping into your function 
evaluations. 

For this method to give realistic esti¬ 
mates in practice, a few subtle points need 
to be considered The first is that die quan¬ 
tity 2 E/h is only an upper bound on the 
finite-difference error. There is no guar¬ 
antee that the actual emir for a given step 
size will hit this upper bound. In fact, it 
is often the case that the actual error in 
evaluating a function varies widely at 
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Introducing Multi-Edit 8,0. The only editing 
tool you need is faster and more powerful than ever 
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Multi-Edit is the only editor you'll need because ids the only 
editor designed to work the way you work. Multi-Edit watches as 
you type, creating custom templates for you (this feature alone can 
cut your keystrokes in half). And whatever you code in, from C++, 
Modula-2, Fortran, HTML, Java, Delphi and more, the language support you 
need is there for you, RAD environment integration packages and WebLair (a 
complete web editor) are all part of the Multi-Edit environment. In fact, if Multi- 
Edit doesn't already support the language or compiler you use, just let us know and well 
add it at no charge. It r s that simple. 

WHAT'S NEW TO 8,0? 


j * Full 32-bit performance 
j * WebLair - HTML f website management 
^ environment, now integrated in Multi-Edit 
j * Integral FTP support 
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* search/compile results and more at your fin- 
\ gertips 
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WWW.MULTIEDIT.COM 
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■ Automatic window sorting 

1 Multi keystroke and language specific key 
assignments 

■ Code completion 

■ Project manager 
‘Plus much more... 


Outside llit; US S Cjju tunlacl: 
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backed by a 30-day uncundiEiunal guarantee. Ollier piu lI uc ts 
trademark* of their respective publishers. 


54 


Dr Dobb’sJournal, May 1998 



















adjacent points. The second concerns the 
choice of step size. If it is taken to be too 
small, the forward difference w'ill be zero, 
and any information on the error will Lave 
been lost. If it is taken to be too large, the 
assumption that the error bound is dom¬ 
inated by condition error will no longer 
be valid. Lastly, what if analytic deriva¬ 
tives are not available? These concerns are 
addressed in algorithms that implement 
these ideas. 

Algorithms 

I will now present algorithms for high- 
precision estimates of first derivatives 
through forward differencing (see the 
aforementioned Practical Optimization). 
The idea is to get a reasonable estimate 
of M (the upper bound on /’s second 
derivative) through straightforward cen¬ 
tral difference estimates. The step size 
for forward differencing is then chosen 
to be Example 3(a). 

The bulk of the work, ihen, is to de¬ 
termine a step size giving a suitable es¬ 
timate of M. 1 assume that the second 
derivative of/is changing slowly enough 
that its value at x is an acceptable ap¬ 
proximation to At. A sequence of trial val¬ 
ues for thaL step size is generated, and 
at each point, the relative condition er¬ 
ror is calculated as in Example 3(b). Here, 
E is our bound on the evaluation error 
in f h is the trial step size, and 0 is the 
estimate of /’s second derivative, cond- 
Erritb) is a measure of the size of the 
condition error only (that is, neglecting 
truncation error) in estimating the sec¬ 
ond derivative, relative to the size of the 
second derivative. Likewise, Lhe relative 
condition errors of the forward and back¬ 
ward difference estimates of f's first 
derivative are found from; 


any monomial of degree greater than 1). 
The “Is" in both numerator and denomi¬ 
nator guard against the effects of small 
values for either/or x If this assumption 
holds, then, from Example 1(e), the opti¬ 
mal interval is Example 3(d). 

However, at a step size of b , cond - 
Err(<P) will be of order one* so we 
choose as our initial estimate of b the 
quantity 10/) to produce a condErr (0) of 
0.01. Of course, it Is entirely possible that 
the assumption on the beliavior of/ s sec¬ 
ond derivative is incorrect. If the initial es¬ 
timate of 0 produces a relative condition 
error that is too small, you simply decrease 
b y and if comlErtidP) is too large, b is in¬ 
creased This is repeated until an accept¬ 
able value is found 


Notice that tilts iteration could fail to 
ever produce an acceptable value of b if, 
say, / is constant. For this reason, we set 
an upper limit on the number of iterations, 
k max - For diagnostic purposes on failure, 
we also record whether or not we ever 
found a value of b that was acceptable for 
estimating the first derivative. This infor¬ 
mation is encoded into the variable b r 
which is initialized to -1, and set to the 
step size h the first time we find an b for 
which both and 0 h have relative con¬ 
dition errors of less than 0.1. (In the fol¬ 
lowing algorithms, ^denotes the forward 
difference approximation to the deriva¬ 
tive, 0 b the backward difference approxi¬ 
mation, and the central difference ap¬ 
proximation.) 


Some combinations 
produce real 
synergy. 





■sualC 


Our Sync Technology 
gives you smooth interaction. 


condErr ((jj) = 


2E 


condErr (0 b ) = 


2E 

*10*1 


For a value of h to be considered ac¬ 
ceptable, condErr(<P) mast be less than 0.1 
(guaranteeing at least one cornea signifi¬ 
cant digit, at least with respect to the con¬ 
dition error). To carefully assess the accu¬ 
racy of 0 some estimate of the truncation 
error is needed. However, it is inconvenient 
to do tliis, and so instead, die relative con¬ 
dition error is requited to be at least 0.001, 
since the truncation error generally rises as 
the condition error falls. To obtain an ini¬ 
tial estimate of b, the assumption shown 
in Example 3(c) is made. 

In other words, it is assumed that /'s 
second derivative is proportional to/ and 
inversely to the square of its argument 
(note tliat this assumption is satisfied by 


Codewrighf s Sync Technology 
simplifies using your stand-alone editor 
with your integrated environment, 
whether it’s Microsoft Visual C++, or 
one of Borland’s IDEs. 

Sync takes care of the details of saving 
and reloading files when you switch 
between applications. The files you 
edit in one are automatically loaded In 
the other. Sync gives you external 
access to wizards, and menus items in 
your IDE, for added convenience. 
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(a) 

f(x+h)*f(x)+^(x)h+ 1 (-g /,2 w tiere x<;<x+h 

<b> 

aff /wl- t(x+h)-1(x) 1 d 2 t /r 

dx {) ~ h 2 dx* <U 

(c) 

g(x+h)-g(x) _ 
h 


f(x+h) + eps (x+h )-f(x) -eps (x) 
h 


f(x+h)-f(x)+ eps(x+h)-eps(x) 
h b 

» 

±(M)-2Eh2 

(e) 



Example 1: (a) Taylor s theorem; (b) solving for df/dx; (c) actual estimate; (d) 
computed estimate; (e) solving for h. 



Example 2: fa) Difference approximation to the first derivative; (h) difference 
approximation to the second derivative. 


Algorithm #1 

J. Initialization. Set where h 

=2( 1 + \x | ) VfMl+1 f(x) |). Set k=Q, 
Evaluate $f(h), <p h (h) f 0(h), and their 
relative condition errors. Set h=-l, 

2. Evaluate the initial estimate. If 
max(condErr(f)j), condErr(<p b )<QA y 
set h=h. If 0.001 <condErr( <3>KG.1, go 
to step 5. If condErr(0)< 0.001, 
go to step 4, Otherwise, continue at 
step 3. 

3- Relative condition error too large. 

a. Set k^krl, and h=10/7. 

b. Recompute the finite differences and 
their relative condition errors. 

c. If b=- 1, and rmixfarndEni<$>- ) f cond- 
Err($ h ))< 0.1, set b = h. 

d. ff condErr(0)< 0.1* go to step 5. 

e. If k=k max . y go to step 6. 

f. Go back to step 3a, 

4. Relative condition error too small. 

a. Set £=jfe+l f and h=hf\ 0. 

b. Recompute the finite differences 
and their relative condition 
errors. 

c. W h=-h and maxicondErK0), cond- 
En(0 h })< 0,1, set b~h. 

d. If condErr(0)>OA, go to step 5. 

e. If k=k Max , go to step 6. 

f. Go hack lo step 4a, 

5. Success. Set h ( ^=2^E/'\0\ t die error es¬ 
timate is 0. 
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6- Failure- 

a. If £*=]., then/appears to be nearly 
constant- Set b 0 p t =b, and set <pj = 0; 
the error estimate is zero. 

b. If by£- 1 and condErr1<P)> 0.1, then 
/appears to be linear or odd. Set 
A 0/ >r6 ff , <P=0, and set the error 
estimate as in step 5. 

c. Otherwise, the second derivative 
appears to be changing too rapid¬ 
ly- Set h u ^h and set the error es¬ 
timate as in step 5. 

Algorithm #1 assumes the availability of 
E. Often this is not a problem; E may be 
estimated by E- \f\(machine precision). 
In other cases, however, this may not be 
sufficient. If analytic derivatives are avail¬ 
able, then the method outlined here could 
lie used to compute eps(x). Of course, if 
you have analytic derivatives, then you 
don't care about Algorithm #1 (although 
[ suppose that if derivative evaluation were 
expensive enough, you might do it once 
to gel eps, and then use Algorithm #1), In 
general, though, the problem is that ini¬ 
tially, you have neither a reasonable esti¬ 
mate of the derivative or of E. Luckily, Fve 
found that, in practice, an order of mag¬ 
nitude estimate of E is sufficient for Al¬ 
gorithm *1. Consequently, Algorithm *2 
may be used to perform a high-precision 
estimate of die derivative when only func¬ 
tion evaluations are possible. 

Algorithm #2 

1. Use some Fixed step length to estimate 
df/dx through forward differences. 

2. Use this result to estimate E (see Algo¬ 
rithm #3). 

3. Use the result of step 2 in Algorithm #1, 


Previously, 1 mentioned that there were 
a few subtle points to keep in mind when 
trying to back out E. Algorithm #1 takes 


(a) 


ff» 

condErr(<l>)=4 — 

(c) 


«9 

h=2(U\x\)^j — ^ — 
'l(1+if(x)\) 


Example J; (a) Step size for forward 
differencing; (h) condition error in 
estimating the second derivative is 
calculated; (c) obtaining art initial 
estimate of h; (d) optimal interval 


them into account. The idea is to find the 
smallest step size that produces a change 
in the function value, and use that for a for¬ 
ward difference estimate. Any step size less 
than that gives no useful information, since 
the resulting Unite-difference estimate will 
always be zero, independent of the func¬ 
tion, On die other hand, the smaller the step 
liecomes, the more the condition error term 
dominates the error bound. This leads you 
to use die smallest step size possible, 

Tlie algorithm proceeds in three distinct 
phases. The first attempts to bracket the 
least significant step; A denotes the low¬ 
er bound and B the upper. The second 
phase uses a simple bisection algorithm 
to find the least-significant step within a 
user-specified relative tolerance, relTol 


(tliis dues not have to be unduly tight; in 
practice; I’ve found that a value of 0-1 
works well). The last phase takes this ap¬ 
proximation to tlie least-significant step 
and uses it to compute the forward, back¬ 
ward. and central difference approxima¬ 
tions to the first derivative. These am then 
compared to the “known* derivative val¬ 
ue, and (for each approximation) E is com¬ 
puted. The maximum value is returned. 

Algorithm #3 

1. Bracketing Phase. Break die procedure 
into cases: If both f(x) and df/chdx) are 
nonzero, go to step 2. If dffdx(x) is 
nonzero, but f( x>“0, go to step 3, and 
if bodi f(x) and df/dxix) are zero, go 
to step 4. 



Some combinations 
produce 
synergy. 
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f(x,y)-( 2sin(x)-cos(y)-hcos(x) 

\ -2-cos{x)+2sin(y)+3cos(y) 


Figure 1: Test 1: The trigonometric 
function. 

2. Bracketing Phase, Case L A reasonable 
guess for h might seem to be Jjc \eps(M) f 
which would be the least-significant 
change you could possible make in x. 
However, when x is small, this can lead 
to absurdly small guesses for b t which 
slow the algorithm considerably, So T 
when x is small, we instead reason tiiat 
JfxhepsiM) [fix) |~/f x+h)~flx)+d/dx1x)h 
when h Ls die smallest significant step. 
So, if |x | >1, let A=\x \ eps(M) else let 
A=eps(M) [fix) \\ | df/dxtx) |. Let B-1QA, 

a. Ensure that A is a Lower Bound. 
While f(x+A)rf(x). let B^A t A=A/ 10. 

h. Ensure that B is an Upper Bound* 
While fix+B.^fCx), let A=B. B=\i)B. 
c, Go to step 5. 

3, Bracketing Phase, Case 2. Since fix )=0 t 
function values near a: are very small 
(near underflow), and the smallest- 
significant step she may lie hard to find. 
Instead, we simply set A to 0. 

a* Ensure that B is an Upper Bound 
While ftx+BhJlxX let A=B, B=\QB. 

b. Go to step 5. 

4* Bracketing Phase, Case 3* Since 
df/dx(x)-Q t the function may even be 
constant, so we must set some maxi¬ 
mum number of guesses, say k inijX , let 
^“0, &-0. 

a. Ensure that B is an Upper Bound. 
While jlx+B)= 0, and k<k mitxt let 
A=B, B= l()B, jHh-1* 

b. If k=k imi _ x , then Lhe function appears 
to lie constant—return an error code* 

c. Go to step 5. 


Table 1: Errors incurred in Test l 



Figure 2: A log-log plot of Test 1 s step 
sizes against errors. 


5- Search Phase, Carry out a binary search 
until A and B agree to relToL 

6, Estimation Phase, Compute (B) t 
§ b (Bh $JB). Compute ertif)=\£(B)- 
dffchfxl enibh\$ b (B)-df/cbdx)\ and 
err(c)— 2 \ $ C (B)- dfZdx(x) [. 

7. Return max(err(f) t eni b), err(cj). 

Numerical Experiments 

fve implemented these algorithms in C++ 
(available electronically; see “Resource 
Center," page 3X and applied them to 
rwn test functions that show the algo¬ 
rithms in their best and worst light, in 
each case, the derivative was first esti¬ 
mated cheaply (comparatively) in the fol¬ 
lowing way: 

h Estimate E by 0+ [/far) | hps(M). 

2, Set h= j^| V(7i7, 

3* Use h to compute 

That estimate w T as then used in Algo¬ 
rithm #2 to produce a more careful ap¬ 
proximation. 

Figure 1 presents the first test function. 
Table I lists the errors incurred by for¬ 
ward difference approximations to the first 
partial derivative with respect to x at the 
point (0,0) (tine exact value is—2), Figure 


Step Size 

Estimated 

Derivative 

LoglO(step) 

Log 10 (error) 

1.G0OOOOE-O7 

1.78814 

-7 

-0.673951032 

1.000000E-06 

2.02656 


-1.575771929 

1.000000E-05 

2.00271 

-5 

-2,567030709 

3.452670E-D4 

1.99979 

-3.461844929 

-3.677780705 

6,4O6720E-O4 

1.99964 

-3.193364256 

-3.443697499 

1.000000E-03 

1.99947 

“3 

-3.27572413 

1000000E-02 

1.99497 

-2 

-2.298432015 

1,000000E-01 

1 94671 

-1 

-1.27335428 

1.000000E+00 

1.22324 

0 

-0.109713147 


Step Size 

Estimated 

Derivative 

LoglQ(step) 

Log10(error) 

1.0000QGE—04 

1.144410E-01 

-4 

-2.121667819 

1.000000E—03 

1.220700E-01 

-3 

-4,140861703 

2.444270E“03 

1.242680E-01 

-2.611850823 

-2.643916751 

1.000000E—02 

1.319410E-01 

”2 

-2.002469457 

1.035800E-D1 

2.255080E-01 

-0.984724093 

-0,985016433 

1.000000E+00 

1.121400E+00 

0 

-0.000259655 


Table 2: Errors incurred in Test 2. 


((xHx-IOOp+IO^X-SOQ ) 3 


Figure S' Test 2: Example 8. 5 from 
Practical Optimizations. 



Figure 4; A log dog plot of Test 2*s step 
sizes against errors. 

2 shows a log-Jog plot of the step sizes 
against the errors. 

Clearly, the optimal step length is near 
3.45266c-4, which happens to lie the 
square root of machine precision. So T in 
this case, the standard step length does 
quite well, giving an estimate of about 
T99979- Algorithm #1 yields a step size 
of 6,40672e- 4, and yields an estimate of 
about 1.99964 1 comparable in accuracy to 
tile standard method. 

Figure 3 presents the second test func¬ 
tion (based on Example 8.5 in Practical 
Optimization) Table 2 lists the errors in¬ 
curred in estimating the derivative at 
a- 100,001 (the exact value is about 
,1219977), Figure 4 displays the same log- 
log plot as in Figure 2. In this case, the 
quick-and-dirty approach yields a step 
length of about .10358, and an estimated 
derivative of .225505 (or no correct sig¬ 
nificant digits). Algorithm #1, on the oili¬ 
er hand, yields a step length of about 
2*444276-3, and an estimated derivative 
of about ,124268 (or two correct signifi¬ 
cant digits). 

Conclusion 

The algorithms presented here can cer¬ 
tainly improve the accuracy of numerical 
methods, though at the cost ol significantly 
more function evaluations. They can prin¬ 
cipally be applied to practical situations 
where obtaining greater accuracy from die 
function evaluations is difficult or impos¬ 
sible. Another application of estimates of 
function evaluation error lies in appropri¬ 
ately tolerancing iterative schemes. Again, 
this generally is not covered in the aca¬ 
demic literature, where convergence cri¬ 
teria can l>e adjusted as needed. Howev¬ 
er in engineering software development, 
solvers must work reliably over a wide 
range of problems, and a tolerancing 
scheme dial adapts to Lhe available accu¬ 
racy can help. 

DDJ 
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The Pentium 
FOOF Bug 


Workarounds for 
a nasty problem 

Robert R. Collins 

L ast fall, a message warning users of a 
bug in the Pentium/Pentium MMX was 
anonymously posted on lire comp 
.os.lmux .advocacy Internet newsgroup. 
According to the message, the bug could 
completely kx:k up the computer from any 
operating mode in any operating system. 
At first glance, the ordinary news reader 
might say “so what.” After all, many users 
are accustomed to Windows 3-1/95 regu¬ 
larly locking up. But the placement of the 
bug on comp,os.Iinux.advocacy was cal¬ 
culated and intentional The readers on tliat 
newsgroup knew exactly what the bug 
meant— an ordinary user or saboteur could 
unleash a program to bung down network 
servers that form the backbone of our mod¬ 
ern networking community. Internet ser¬ 
vice providers (ISPs), web hosts, govern¬ 
ment agencies, and computer departments 
at universities were petrified of the poten¬ 
tial damage that this bug could do, 

Enter the FOOF Bug 

When any x86 processor from the 80186 
and beyond encounters an invalid in¬ 
struction, the processor is supposed to gen¬ 
erate an invalid opcode exception. In In¬ 
tel vernacular, the undefined opcode 
exception is known as a This han¬ 

dler usually signals an error condition and 


Robert is an independent consultant on 
the x86 architecture . He can he reached 
at rcoUim@x86.org. 


terminates the errant program. When this 
mechanism works, the errant program 
ean T t hann the computer system. Should 
this mechanism fail, however, the errant 
program can bring down the entire com¬ 
puter If the computer is a network serv¬ 
er or ISP, then the errant program can 
bring down the entire network. 

Thafs what can, in fart, happen when 
the Pentium encounters the “FOOF bug, 
which maps to a LOCK CMPXCHG8B EAX 



instruction, CMPXCHGKB compares 64-bit 
memory contents with the contents in EDX 
and FAX. One of the operands must he 
memory, and the other (implied) operand 
Ls EDX:EAX, It is possible to construct an 
instruction encoding that doesn't map to 
a memory operand. Since the nonmemo¬ 
ry form of tills instruction is invalid, a com¬ 
piler or assembler will not generate this 
code. Instead, assembly-language pro¬ 
grammers must construct it by hand. 

Such an illegal encoding should gener¬ 
ate the requisite #UD. As you'd expect, a 
CMPXCHG8B EAX instruction generates 
a #UD. However, when this illegal en¬ 


coding is prepended with a LOCK prefix, 
the processor fails to work correctly. 

Using the LOCK prefix on this form of 
CMPXCHG8B is illegal in and of itself. LOCK 
prefixes are only allowed on memory- 
based read-modify-write instructions. 
Hence a LOCK prefix on the register-based 
CMPXCHG8B EAX instruction should also 
generate an invalid opcode exception. 

Instead, the Pentium locks up and 
freezes the entire computer when it en¬ 
counters this instruction. This bug is es¬ 
pecially nasty, because any user can con¬ 
struct a program with this instruction, and 
upload it to a network computer, or in¬ 
corporate it within an ActiveX applet. 
Once the program is run on the network, 
the network server crashes. The only pos¬ 
sible recovery comes by hitting the big 
red switch. Suppose you download an Ac¬ 
tiveX applet that contains this code. As 
soon as the code executes, your computer 
freezes up. 

Within one week of the discovery of 
the bug, Intel announced a software 
workaround that can be incorporated into 
virtually any operating system (except real¬ 
mode operating systems, like DOS). 

How the Bug Works 

When the processor encounters the in¬ 
struction F0 OF C7 C8 (or anything from 
FO OF C7 C8..CF), the FOOF bug occurs. 
The processor recognizes that an invalid 
opcode has occurred and tries to dispatch 
the #UD handler. Because of the LOCK 
prefix, the processor is confused. When 
the processor issues the bus reads to get 
the #UD handler vector address, the pro¬ 
cessor erroneously asserts the LOCK# sig¬ 
nal. The LOCK# signal can only be as¬ 
serted for read-modify-write instructions 
that modify memory. When the bus is 
locked, a locked memory read must be 
followed by a locked memory write, lest 
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(continued from page 62} 
unpredictable results may occur. But in 
this case, the LOCK# signal remains as¬ 
serted for the two consecutive memory 
reads required to retrieve the #UD vector 
address. The processor never issues any 
intervening locked write, and then hangs 
itself This behavior is shown in the logic 
analyzer trace in Example L As you can 
see, die Pentium tries to retrieve the #UD 


vector with two locked reads. After that, 
all processor activity stops. 

The Various Workarounds 

There are various workarounds to this 
bug — not all of them are good (a lew are 
outright kludgey, in fret). One workaround 
Intel has proposed actually takes advan¬ 
tage of die bug's behavior to do die right 
thing. Another Intel workaround is ingc- 


Sequeuee Address Data Mnemonic Timestamp 


T 524285 000000B2 -7E-— (-IO-WRITE-) - — ---.300 ns 

524286 00000018 -E14C ( LOCKED MEM READ ) 440 ns 

524287 0000001A F000 - ( LOCKED MEM READ ) 100 ns 


Example I: FOOF bug example. 
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nious, though a kludge. The first two al¬ 
ternate workarounds, to be presented 
shortly, arc given for academic purposes 
only. Even though the workarounds have 
demonstrated their ability to obscure the 
bug behavior, they are not entirely reliable. 

Infers First Workaround 

Pan I r IDT Alignment: 

A) Align the Interrupt Descriptor Table 
(IDT) such that it spans a 4-KB page 
boundary by placing the first entry start¬ 
ing 56 byres from the end of the first 4- 
KB page This places the first seven en¬ 
tries (0—6) on the first 4-KB page, and the 
remaining entries on the second page. 

B) The page containing the first seven en¬ 
tries of the IDT must not have a mapping 
in the OS page tables. T his will cause any 
of exceptions CHS it? generate a page not 
present fault. A page fault prevents the bus 
lock condition and gives the OS complete 
control to process these exceptions as ap¬ 
propriate. Note that exception 6 is the in¬ 
valid opcode exception, so with this 
scheme an OS has complete control or any 
program executing an invalid CMPXCHGSB 
instruction. 

Part II, Page Fault Handler Modifications: 

A) Recognize accesses to the first page 
of the IDT by testing the fault address in 
CR2. Page not present faults on other ad¬ 
dresses can be processed normally. 

B) For page not present faults on the 
First page of the IDT, the QS must rec¬ 
ognize and dispatch the exception which 
caused the page not present fault. Be¬ 
fore proceeding, test the fault address in 
CR2 to determine if it is in the address 
range corresponding to exceptions 0-6. 

C) Calculate which exception caused the 
page nest present fault from the fault ad¬ 
dress in CR2. 

D ) Depending on the operating system, cer¬ 
tain privilege level cheeks may be required, 
along with adjustments to the interrupt stack, 

EJJump to the normal handler for the ap¬ 
propriate exception. 

This is an ingenious solution to a horri¬ 
ble problem. Tin fortunately, the solution is 
as bad as the problem. When the proces¬ 
sor receives any of die first seven excep¬ 
tions (Divide by Zero through Invalid Op¬ 
code), the processor generates a page fault 
instead of the appropriate exception. The 
page-fault handler gets mucked up with all 
kinds of code to check privilege levels and 
whether the fault was caused by another 
exception. If'l had my druthers, Ed stay as 
far away from this solution as passible. 

Intel's Second Workaround 

Part l, IDT Page Access: 

A) Mark the page containing the first sev¬ 
en entries (0-6) of die IDT as read only by 
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setting bit 1 of the page tabic entry to zero. 
Also set CRO.WP (bit 16) to L Mow when 
the invalid opcode exception occurs on the 
locked CMPXCHG8B instruction, die pro¬ 
cessor will trigger a page bruit since it does 
not have write access to the page contain¬ 
ing entry 6 of the IDT. This page fault pre¬ 
vents the bus lock condition and gives the 
OS complete control to process the invalid 
operand exception as appropriate. Note 


When the processor 
encounters the 
instruction FO OF 
C7 C8, the 
bug occurs 


dial exception 6 is the invalid opcode ex¬ 
ception, so with this scheme an OS has 
complete control of any program execut¬ 
ing an invalid CMPXCHG8B instruction. 

R) Optional: If updates to entries 7-255 or 
rjie IDT occur during the course oF normal 
operation, page faults should be avoided 
on writes to these IDT entries. These page 
faults can be avoided by aligning the IDT 
across a 4-Kb page boundary such that the 
first seven entries (0-6) of the TDT are on 
the first read only page and the remaining 
entries are on a read/writeable page. 

Part II, Page Fault Handler Modifications: 

A) Modify the page fault handler to calcu¬ 
late which exception caused the page fault 
using the fault address in CR2. If the error 
code on the stack indicates the exception 
occurred from ring 0 and if the address cor¬ 
responds to Lhe invalid opcode exception, 
then pop the error code off the stack and 
jump to die invalid opcode exception han¬ 
dler. Otherwise continue w ith the normal 
page fault handler. 

Tilts workaround is really quite clever, 
in that it takes advantage of the bug as a 
means to provide a fix to the problem. 
When any of the first six exceptions oc¬ 
cur, they are handled as they normally 
would. Divide by Zero through BOUND 
exceptions vector to their normal excep¬ 
tion handlers without any intervening code 
in die page-fault handler. However, when 
die FQQF bug occurs, the page-fault han¬ 
dler is invoked instead of the #UD han¬ 
dier. Wily? CR0,WP=1 instructs the micro¬ 
processor to generate a page fault when 
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Sequence 

Address 

Data 

Mnemonic 

Timestamp 

428677 

00060FF3 

0008049A 

( LOCKED MEM READ ) 

330 

ns 


00060FFC 

00008E00 

( LOCKED MEM READ ) 



4ZS678 

000IE8B8 

EFF0FFFF 

( LOCKED MEM READ ) 

170 

ns 


0001E8BC 

00009B01 

( LOCKED MEM READ ) 



428679 

0001EE9C 

00000008 

( LOCKED MEM WRITE ) 

130 

ns 

428680 

0001EE98 

000003F2 

( MEM WRITE ) 

60 

ns 


Example 2: FOOF hug on noncacheable page . 


Sequence 

Address 

Data 

Mnemonic 

Timestamp 

429135 

00060FF8 

0008049A 

( LOCKED MEM READ ) 

330 

ns 


00060FFC 

0000SE00 

( LOCKED MEM READ ) 



429136 

0001E8B8 

BFF0FFFF 

{ LOCKED MEM READ ) 

170 

ns 


0001E8EG 

00009B01 

( LOCKED MEM READ ) 



429137 

0001EE9C 

00000008 

( LOCKED MEM WRITE ) 

140 

ns 

429138 

0001EE98 

000003F2 

( MEM WRITE ) 

50 

Tig 


Example 3: FOOF hug on page write-through. 


Sequence 

Address 

Data 

Mnemonic 

Timestamp 

426333 

00060FF8 

00080496 

( LOCKED MEM READ ) 

60 

ns 


00060FFC 

00008E00 

{ LOCKED MEM READ ) 



426334 

0001E8B8 

BFF0FFFF 

( LOCKED MEM READ ) 

170 

ns 


0001ESBC 

00009B01 

( LOCKED MEM READ ) 



426335 

0001EEA0 

00010002 

( LOCKED MEM WRITE 

120 

us 

426336 

0001EE9C 

00000008 

( MEM WRITE ) 

60 

ns 

426337 

0001EE98 

000003EE 

( MEM WRITE ) 

50 

ns 


Example 4: FOOF hug with cache disabled. 


tries 0-6) from the end of the first 4-KB 
page. This places the first seven entries 
(0-6) on the first 4-KB page, and the re¬ 
maining entries on the second page. 

Part II r Page Table Modification: 

A) Mark the first 4-KB page of the IDT as 
noncacheable by setting the appropriate 
Page Table Entry to Page Write-Through 
(PTE.PWT=1). This is by far the best so¬ 
lution. 

This solution maintains all of the ben¬ 
efits of having the page cacheable. How¬ 
ever, because the page is considered write- 
through, the processor is tricked into 
recovering from the bus LOCK up condi¬ 
tion. Example 3 shows the results of en¬ 
countering the FOOT bug when the page 
is cacheable, but marked as write-through. 

Alternate Solution #3 (For DOS Users) 

A) Turn off your cache, Enter your BIOS 
setup utility, and disable the processor 
cache. This prevents the FOOF bug from 
locking up your computer. 

This isn't really a viable solution for 
most people, Turning off the micropro¬ 
cessor cache can have a dramatically neg¬ 
ative performance impact on your com¬ 
puter. Example 4 shows the results of the 
FOOF bug with cache disabled. 


an attempt is made by the supervisor to 
modify a memory page. The processor 
doesn't actually attempt to modify ihe In¬ 
terrupt Descriptor Page (IDT page hold¬ 
ing the #UD vector address) when the 
FOOF opcode is encountered. But the hug 
actually makes the processor think iris 
mexiifying die IDT page with the #UD vec¬ 
tor. The locked memory cycle somehow 
convinces the internal state of the Pen¬ 
tium to Blink that a write cycle is going 
to occur. Since the transition to the #UD 
handler is considered a supervisor task, 
the processor thinks it s going to write to 
this page. Thus when CR0.WP=1, a page 
fauil occurs. 

Even though this is a clever fix f there 
are iwo things 1 don’t like about it: 

• The fix requires a kludge to the page- 
fault handler code. The page-fault han¬ 
dler must contain code to detect this con¬ 
dition, and vector to die #UD handler 

* Setting CR0.WP=1 isn't a viable solution 
for everybody. Some operating systems 
may not be able to use this workaround. 

If I were forced to choose between two 
of Intel’s “blessed" solutions, I'd choose 
this one. However, because Intel set a 
precedent in documenting a solution that 
actually takes advantage of the bug be¬ 
havior, this could give rise to much more 
elegant solutions that also take advantage 
of die bug behavior. 


In fact, there are several alternative so¬ 
lutions to the problem, some quite simple. 

Alternate Solution #1 

Part 1, IDT Alignment: 

A) Align the interrupt Descriptor Table 
(IDT) such that it spans a 4-KB page Ixxmd- 
ary by placing the first 66 bytes (IDT en¬ 
tries 0-6) from the end of die first 4-KB 
page. This places the first seven entries 
(0—6) on the first 4-KB page, and the re¬ 
maining entries on the second page. 

Part II. Page Table Modification: 

A) Mark the first 4-KB page of the IDT as 
noncacheable by setting the appropriate 
Page Table Entry to Page Cache Disable 
(PTE.FCD-1). 

All exceptions vector to their appropri¬ 
ate interrupt handler. The page fault han¬ 
dier doesn't need to lie* mucked up with 
any extra code. All of die exception han¬ 
dling code may remain unmodified. When 
the FOOF bug occurs, the processor issues 
the two consecutive locked reads. How¬ 
ever, die processor doesn’t lock up because 
the page is noncacheable. Example 2 
shows die logic analyzer trace of die micro- 
processor recovering from die FOOF bug. 

Alternate Solution #2 

Part 1, IDT Alignment: 

A) Align the interrupt Descriptor Table 
(IDT') such that it spans a 4-KB page bound¬ 
ary by placing the first 56 bytes ( IDT en¬ 


Condusion 

The ultimate solution to the FOOF bug 
problem is obtained when disabling the 
cache—indicating that some interaction 
exists between the cache and the bug. 
Instead of taking advantage of the cache 
interaction, Intel's second solution takes 
advantage of interaction between the bug 
and page-fault mechanism. Now that In- 
id has seL the precedent of using the bug 
behavior as a workaround, nobody 
should be concerned by the two more 
elegant solutions provided herein. My 
second alternate solution is by far the 
best. The exception handlers don't need 
to be mucked up with extra code, and 
the processor performance isn’t impact¬ 
ed in the slightest. Note, however, that 
neither of these two alternate work¬ 
arounds have been thoroughly tested in 
production code. 

I've written two programs so that you 
can examine this bug and the various 
workarounds: FQ0FBUG,EXE, which 
demonstrates all live workarounds for the 
bug T and F0DFBUG2.EXE, which demon¬ 
strates the most elegant workaround— 
Alternate Solution #2, The source code 
and executables for both programs are 
available electronically from both DDJ 
(see “Resource Center,” page 3) and at 
ftp://ftp.x86.org/doads/F00FBUG.2lP. 


DDJ 


66 


Dr Debt's Journal, May 1998 


















NuMeca 

DevPartner 


The StmrlDefeMgjjtnx" Companion for 


Microsoft Visual C++ 


PRODUCTIVITY 


VISUAL G 


WITHOUT 


rupTniBf- 


EXPANDING 


CpMPOWARE 


Compaware 


m T Compaware 4 

JMuMega 


Copyright o 19?8 Compuware Corporation All rights reserved, All trademarks and registered trademarks are the property cF their respective owners 


Finding good experienced developers to crank out defect-Free usable software is tough (not to mention expensive). 
But with NuMega DevPartner For Visual C++, you can turn up the productivity and capability oF your existing team. 
Its suite oF SmartDebugging tools For Microsoft Visual C++ accelerates development oF Visual C+* components and 
applications For the enterprise and the Internet. Using DevPartner for Visual C++, individual developers can 
deliver high quality well-tuned applications and components almost as Fast as whole teams can. That's because 
it automatically detects, diagnoses and Facilitates resolution of software errors and performance problems. Get 
^nlerwfse faster development and better code—all in one value-priced package—without adding 

Devli^raEnt^tner developers to your team. Order NuMega DevPartner For Visual C+ today. 


1-888-686-3403 www.numeqa.com 

SmartDebugging '' 1 For Faster development VISUAL C++"' VISUAL BASIC® JAVA '" 1 


30-0 AY MONEY BACK GUARANTEE 






We admit that on occasion we’ve even asked you 
to rip and replace what’s inside your head. 

But this is not one of those occasions. Instead 
we’ve got a plan to help you build the latest enterprise 
solutions using technologies you’re already familiar 
with. Microsoft’ Windows NT," COM, and the Visual 
Studio" development system can help you make 
it happen. It doesn’t even matter if you have UNIX, 
NetWare or a legacy system that needs to be 
integrated. Hey, this whole distributed computing 
thing gets quite a bit easier when you use what you 
have, and more importantly, use what you know. 
To find out more about building new solutions with 
Microsoft Windows NT and Microsoft Visual Studio 
go to www.microsoft.com/msdn 


« 1*08 MJcrPspft Corporation. M rigfitE reserved. MIcrDsult, Vlyygl Stmtflrr, Wbefff rt&ydu wml Kt toff®? tihd Window Ml are clt|w feggtertKI IfftdCTilBI** ul Qi IVN6HMWI1 Curpwitl*fl hr. the 

United Slates anrJ/nr other iMjunwies. OV^r product and company names mentioned heJfHi* flirty be Bte Uanfemrw&t of their respective owners. KartofT itocnsetl by Kartufl tindcfiwi&ea, Rittreeented by 
the Roger Rirhirtan Irir., Htwrty Hills. CA- t’Twftienat'aln ts a ftHfemafli ami Hjfjyrshil or Uruywsa* Clly srudkis, inc. Licensed by Universe Stuato* Licensing. Inc All rights Tescsrwd. 






Microsoft 

WHef« do you want to go today?' 



Microsoft* 




BackOffice” 

Microsoft® 


vv 


VISUAL 

Studio 

Visual Sasic 
Visual C++' 
Visual I r>terDev’ 
Visual J++" 
Visual FoxPro" 


I 














Extending 

Windows CE 2.0 MFC 
Database Classes 


Adding powerful 
search capabilities 

John C. Schettino, Jr. 

W indows CE programming is a lot 
like Win32 programming, since 
the CE API is a subset of Win32, 
w ith some CE-specific extensions. 
One of those extensions is an API for ob¬ 
ject databases. The object database API is 
reasonably rich, including indexed access 
to records in databases, with up to four 
indexes supported for each database. In¬ 
dexes are only on single integer or string 
properties, in ascending or descending or¬ 
der. There is a seek function in the API 
that uses the current index for rapid 
searches. Like its bigger siblings, Windows 
CE has a version of die Microsoft Foun¬ 
dation Class (MFC) library, which I con¬ 
sider the highway— or at least die paved 
road*—for developing CE applications. By 
comparison, using Win32/C is more like 
a gravel road. Bui because MFC in Ver- 


John is a developer of Windows CE f New¬ 
ton r and Macintosh software. He is also a 
Senior Member of the Technical Staff at 
GTE Laboratories Inc. He can he contact¬ 
ed at schettino@writeme.com or at http:// 
memhers.aol.com/pdcjohm/. 


sion 1,0 of the CE SDK did not provide 
classes for dealing with databases, I hit 
the gravel mad pretty often. 

Just when 1 was about to break down 
and write my own wrapper classes, ihe 
CE SDK Version 2.0 beta was released. 
Thankfully, the SDK 2.0 adds several class¬ 
es to MFC that wrap the database API and 



data structures into a more usable form. 
While this API is sufficient to accomplish 
most database tasks, it still lacks a general- 
purpose searching class. Consequently, in 
this article, 1 11 present a set of CE database 
classes and subclasses of the new 2.0 MFC 
classes that provide an object-oriented 
w rapper to the basic database search API. 
The complete wrapper is available elec¬ 
tronically; see “Resource Center," page 3- 


What's New in SDK 2.0 

The main database class is CCeDB- 
Daiahase. Objects of this class encapsu¬ 
late an open database, and support sev¬ 
eral operations. The operations I’m most 
interested in for searching are SeekFirstf), 
SeekFimEquaK), SeekNextO, SeekNext- 
EtfualO, SeekValueSmallerCX and Seek- 
VahteGreaUMJ. Tine SeekFirstEquaK) and 
last two seek methods take a CCeDBProp 
object as their only parameter, and use the 
current index to locate the first record 
equal to, smaller, or larger than the value 
encapsulated within. CCeDBProp is a 
wrapper class for both record properties 
and index properties. It contains a prop¬ 
erty identifier and value. The CCeDB- 
Record class encapsulates a record in the 
database, and consists of a set of 
CCeDBProp objects. It provides a couple 
of methods for extracting a property from 
the record. 

While there is a lot more to these class¬ 
es than Eve covered here, it’s about all 1 
need to extend the classes to add a more 
robust search capability. 

Objectives of the Seorch Classes 

What I d like to be able to do with MFC 
is encapsulate a query on an open 
database in an object, hand that object to 
a method of a database object, and get 
back an ‘‘iterater.” i could then retrieve 
matching records by simply calling the 
first/) and next/) methods of iterater. The 
query would support the intrinsic queries 
supported by the database, that is less 
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Application Development Tools 
"We Initially coded algorithms in 
Matlab and then converted the 
Matlab source to C or C++. To our 
surprise, the Matlab code was 
faster in nearly all cases/' 

Jack Slaub 
Hughes Aircraft 



Interactive GUI Design 

"In one day, I wrote S75 lines of 
Matlab which equates to 5,000 
lines of C code. I had a functioning 
GUI in one day. You can't do that 
with C " 

Kathleen Splaine 
Risk International 



Analysis and Visualization 

"Anything from simple analysis to 
complex modeling and simulation 
can he done in a fraction of the 


time It would take to write your 
own code/' 


Gregory l. Chamitoff, Ph.D. 


NASA, Johnson Space Center 


We wrote exactly 
698,794 lines of C 
code so that you 
don’t have to. 


More than 400,000 engineers and scientists use Matlab 
to accelerate their technical programming. Here’s why. 


Faster programming 

Today's most productive technical 
professionals have one thing in 
common - they use Matlab instead 
of C or C++. Because, unlike a 
general purpose language, 

Matlab is a complete, 
integrated analysis, visual¬ 
ization, modeling, and devel¬ 
opment environment specifically 
designed for technical computing. 
So development goes much faster 
and code is dramatically shorter. 

More numerical power built in 

At the heart of Matlab is an easy to 
leam technical computing language 
with more than 500 functions built 
in. The Matlab language includes 
flow control, multidimensional 


arrays, user-defined structures, 
ASCII and binary external data file 
access, and much more. And you’ll 
save even more time with Matlab 
Toolboxes, collections of highly 
optimized, discipline- 
specific functions written 
by world-class experts. 
You can even link in C, 
C++, and Fortran routines. 

Less time coding means more 
time to think 

Put simply use Matlab and it will 
take you far less time to develop 
finely tuned applications with 
revealing graphics, custom GUIs, 
and compact maintainable code. 
Now just imagine what you can do 
with the time you save. 


atlas ^ 

COMPILER 


See how Matlab 5 can help you work faster. Visit our 
Web site for demos, examples, and updating information. 
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(continued from page 70) 
tlian, greater than, or equal to tests. Un¬ 
like the basic index -seek operations, Ed 
like to be able to use more than one 
property in the query, as well as supply 
multiple matching tests For a property. 
Finally, Vd like to lift the restriction on 
using nonindexed properties in a query. 
To top it all off, Fd like the query to be 
as optimal as possible, including making 
full use of the correct index to retrieve 
all the matching records while accessing 
as few records as possible in the 
database, 

This sounds wonderful, but without a 
few simplifying assumptions I'd end up 
coding it for the rest of my life. In addition 
to the gcxxl things Fve decided on, 1 11 make 
one huge concession—FLL assume dial any 
query that contains multiple properties, or 
even multiple tests against a single prop¬ 
erty, is an implicit AND of all tests. Sure, l 
could support OH, but then I'd need a more 
complex representation of die query syn¬ 
tax as well as a query parser. Ill leave that 
as an exercise for the reader. Also not im¬ 
plemented but equally desirable are more 
general tests. It would be nice to lx* able 
to do regular expression matches or lots of 
other tests, and it's obvious exactly where 
those tests could be added, but I'm not do¬ 
ing them now. 

Class Design 

To implement extended searching, Ill first 
need to create the CEQuery class to en¬ 
capsulate a query, That will require sub¬ 
classing the CCeDBProp class so I can 
store test criteria with a property- Then 111 
need to extend the existing CCeDB- 
Database class to add the method to cre¬ 
ate an iterater used to search. Finally, I'll 
have Lo do the hard work of actually im¬ 
plementing the iterater class. 

The CEQuery and 
CCeDfiQueryProp Classes 

A query object consists of a CCeDB- 
Database object, a number of CCeDB- 
QueryProp objects representing the prop¬ 
erties to test, a test value (and the test 
operation), and a CCeDBQueryProp ob¬ 
ject representing the desired sort order 
(if any). Listing One (listings begin on 
page 110) presents the CEQuery class dec¬ 
laration. 

The addFfeldf) method adds a new 
CCeDBQueryProp object to the query. It s 
expected that CEQuery deletes this object 
in its destructor CCeDBQueryPrcp objects 
are stored in an array named _fields that 
is expanded as needed, so any number 
of tests can be performed in a query. 

The setSortFieldf) method takes a 
CCeDBQueryprop object (which is also 
deleted in the destructor) that specifies 
die desired sort order for die query- The 


numFiekisO method returns the current 
number of test fields in the query. 

The testOnO method is called during 
searching. It determines which CCeDB¬ 
QueryProp value (if any) to use to begin 
the query when querying the database. 
Each CCeDBQueryProp is tested against 
the supplied property ID, which is also 
saved in the Jndex member variable. If 
a matching CCeDBQueryProp uses an 
Equal Lest, then that object is selected to 
begin the query. If it matches and uses a 
Less Than (on a property with a de¬ 
scending index) or Greater Than (on a 
property with an ascending index) then 
it is used if none are found with an Equal 
test. At this time if a range search is de¬ 
tected (both a Greater Than and Less 
Than test on the indexed property, for 
example) then the jrange member vari¬ 
able is set to True- The goal of this 
method is to start the search off using a 
query value against the selected index 
that eliminates as many nonmatching 
records as passible. It also determines (by 
setting _range) if it is possible to stop tire 
query after a record fails a test on the in¬ 
dexed property. 

The match!) method is called during 
searciting. It tests every property in the 
supplied record against the CCeDB¬ 
QueryProp objects tn the query- If a range 
test is being done, the index property is 
tested first to see if it falls within the 
range. If it does not, then False is re¬ 
turned, but also an EOF flag is set indi¬ 
cating that the search is complete. The 
EOF flag is not set the first time the range 
test fails, to work around a bug in the 
MFC SeekValueSmaUeH) method. When 
used with a descending index, it actual¬ 
ly returns the record huger dian the sup¬ 
plied value, so the first Limc t the range 
test will fail. It’s not too expensive to do 
a couple of extra tests just to be sure, and 
it works around this bug just fine. The 
match() method calls matcbFielcK) to do 
the actual test for each property, Assum¬ 
ing the range test succeeds or is not per¬ 
formed, then each property in the record 
is tested against its corresponding CCeDB¬ 
QueryProp objects using match Field/ / 
When doing these tests the index field 
tests are skipped if range testing was pre¬ 
viously done. If all tests pass T matchf) 
returns True* 

The match Field/) method tests the 
supplied property from a potentially 
matching record against the specified 
test object, and returns True or False- It 
consists of a large case statement that 
does die appropriate comparison based 
on die data type and test operation for 
a property. 

The CCeDBQueryProp class is a sub¬ 
class of CCeDBProp (see Listing Two for 
its declaration). It adds an enumerated 


type that describes the three types of 
test possible on a property, as well as 
a member variable to hold the specific 
test for the property. To create a query, 

I allocate a new CCeDBQueryProp ob¬ 
ject for each desired test, set its prop¬ 
erty ID, type, value, and operation, and 
then add it to the query object using 
addFietdi"). 

CCeFindDBDatabase 

fd like to be able to hand my query ob¬ 
ject to a method of a CCeDBDatabase ol> 
ject to begin a query, That's accomplished 
by subclassing CCeDBDatabase and 
adding a Find/ ) method; see listing F fhree, 

The Find! ) method accepts a query ob¬ 
ject and allocates a new CCeFindDB- 
Databaselterater object. This object should 
be deleted when fm done with a query. 
To create a new iterater, I call the Find- 
Key() method to determine which key l 
should use, and then I pass that key and 
the CEQuery object to the CCeFindDB- 
Dat abase Iterater con s tructor. 

The TestFind/) method is useful to de¬ 
termine if a particular query will use an 
index or require a sequential search. It 
also calls FindKey() } and if no key is 
found it returns False- This might be handy 
if I want to warn the user that a query 
might take some time, since it will be a 
sequential search, 

The FtndKeyf ) method examines die 
CEQuery object against the current indexes 
defined for the database using the Isln- 
dexed() method. It determines which in¬ 
dex (if any) to use when querying the 
database. If the query includes a sort then 
that index must be used. If not, then the 
CCeDBQueryProp objects that use prop¬ 
erties that are indexed and weighted to 
determine die best index to use. If any of 
them are Equal tests, then that index is 
heavily weighted. If any are Less Than (on 
a property with a descending index) or 
Greater Than (on a property with an as¬ 
cending index) then they are weighted at 
half of properties using Equal. The high¬ 
est weighted index is then chosen. Tills 
ends up favoring the index that will re¬ 
quire the fewest number of tests to com¬ 
plete the query. 

The MndexedQ method compares the 
supplied property ED against all proper¬ 
ty IDs in the database dm have an index. 
If the property ID is indexed, it returns 
True, and also returns die sort order For 
the index. 

CCeFmcfDfiDofabase/ferater 

When 1 query a database, I want a sin¬ 
gle object to encapsulate everything hav¬ 
ing to do with the search in progress. 
The CCeFindDBDatabase!terater class 
holds everything needed to perform a 
search, including copies of the current 


72 


Dr. Dobbs Journal, May 1998 






SEE WHAT YOU 
HAVE BEEN MISSING 




End The Search! 

Looking for development tools to 
improve program performance, 
reliability, and memory usage? 
Look no further! Diab Data’s new 
RTA Suite represents a quantum 
leap forward in die quest for this 
etching new class of tools. 


A Visual Link Map .Analyzer 
for interactive ‘‘drag-and-drop’ 
optimization of memory maps. 




I For Every Developer 

The simplicity of die RTA 
Suite's graphical user interface 
makes tt useful for every 
developer on your team, Mow 
every developer can benefit 
from application specific 
optimizations and the ability 
to enhance code quality 
before Q.A, What's more, die 
RTA Suite is integrated with 
leading development 

environments such as ISI’s pRISM+™, SDS 1 
SlngleStep™, Enea OSE Systems' Illuminator™, 
and others, so you can access the RTA Suite 
from familiar tools. 


The RTA Suite 

Diab Data’s RTA (Run-Time 
Analysis) Suite allows you to 

capture and analyze your code's ■[ 
run time behavior, providing you §|§ 

widi the critical run-time insight 
you need to maximize program 
performance and reliability. Combined with 
Diab Data's industry-leading D-CC m and 
D'C++ m compiler suites, the RTA Suite 
enables you to develop faster, higher 
quality code in less time. 


With The RTA Suite 


Powerful Run-Time 
Analysis Tools 

A Visual Interactive Profiler 
for a revealing view of run¬ 
time program behavior, 
including hierarchical "top- 
down" analysis of parent/ 
children function pairs. 

A Run-Tune Error Checker 
for detecting memory leaks, 
hard-to-fmd pointer errors, 
stack overflows, and many 
other conditions associated with invalid memory references at run-time. 


The Expert's Choice 

Diab Data is the compiler 
leader for embedded 
PowerPC and Motorola 
H" r f RISC applications. So 

J when you are ready to take 
your PowerPC, 68K/ 

J CPU32, GoldFire, or 
M*CORL project to the 
next level, don’t fumble 
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(continued from page 72j 
query object and the database object. List¬ 
ing Four is its declaration. 

The constructor stores copies of the CE- 
Query (query) and CCeDBDatabase 
(database) objects in the _myQuery and 
_myDb member variables. Tt then opens 
the database object in jnyDB using tlie 
selected index. It sets _start to True, _mf 
to False, and zeros out the Jilts and Jests 
member variables. It sets die _primaty- 
Query member variable to the query prop¬ 
erty to begin the search by calling the 
testOnQ method of the CEQuery object. 
The iterator is now ready to retrieve match¬ 
ing records. 

The first() method returns the first 
matching record for the query. It can be 
used to restart a search in progress, or just 
to liegin a search. It just sets _slart to Tate, 
clears die search statistics variables, and 
calls the nextO method. 

The nextO method finds the first (if 
_start is True) or the next matching record 
in the database If _start is True, it begins 
Lhe query by calling SeekFirstf) if there 
is no appropriate lest on an indexed 
field. If there is an Equal test in _pri- 
maryQuery it calls SeekFirstEquaK ), if 
there is a Greater or Less 'Ilian test in 
_ primary Query it calls Seek Value- 


Greateti) or SeekValueSmallerO, If _stml 
is False, then it calls SeekNextEqual() if 
there is an Equal test in _primaryQuery, 
or SeekNextO otherwise. In other words, 
it tries to find the first (or next) value 
using the index and the test value sup¬ 
plied in die query object. If the seek op¬ 
eration found a record, it’s still not ready 
to return. Instead, the method enters a 
while loop calling match() with the 
record. If match( ) returns False but does 
not indicate that EOF has been reached, 
it gets the next record (again using 
SeekNextEquaU ) t or SeekNextf )) and tries 
again. This continues until a record 
matches or EOF is detected. It then re¬ 
turn the matching record or NULL if EOF 
was reached. 

The stats() method returns the num¬ 
ber of records tested and the number of 
matching records for the query. It can 
be called at any time, even during a 
query, to find out how many records 
have been examined and matched by the 
query. 

Example Application 

To test the classes, 1 wrote a simple MFC 
application that creates a database with 
1000 records containing random string, 
long, and date (FILETIME) properties and 


tl fa^mple Query App_E5 ^ 


Siring |gS w | ]Henry ||_E Z | |Lola 

Int [UT] [ 81 ® fGETJ pOOO 

Date [gFT] ff"/[T - /ri990” [iJT] ft - / \T ~/[2050 


Return Sort 


3 


Initialize 


(.s'eaffi'. a 


Figure 1: Query form. 


Results: 

Hpynxotoa 

4685 

Wednesday, February 07, 2046 


luquajtn 

8153 

Wednesday, February 02,1994 


Izpbopxe 

6255 

Saturday, August 17, 2024 


Jdhusu 

7065 

Wednesday, December I3 t 2023 


Jkswlpq 

6956 

Saturday, April 07, 2012 


Jxgxxn 

4137 

Thursday, September 23, 2004 


Jyuhxve 

5032 

Thursday, July 14, 1994 


Jzkooz 

2402 

Tuesday, November 17, 2043 


Keqvdo 

8098 

Tuesday, September 16, 2025 


Kzggvswxua 

4594 

Friday, December 12, 2003 


Lgtarizsq 

7428 

Friday, September I8 r 2026 


Lnrrxhmh 

7922 

Tuesday, October 21, 2031 

Stats: 

matches: 12 
misses: 158 
Total: 170 


Range search was used 




Figure 2: Output from a query. 


then allows searches on it. The suing prop¬ 
erty has an ascending index, and the long 
property has a descending index. The ap¬ 
plication has a simple form (see Figure 1) 
that accepts one or two strings, longs, and 
elates with search criteria* as well as a de¬ 
sired sort of none, string, or integer. 

Selecting Initialize creates a new test 
database with 1000 records. The initialize 
routine uses the CE2.0 database classes to 
create a database with indexes on a string 
and integer field, and populate it with 
records containing random data; see List¬ 
ing Five. 

The database is created and if the Cre¬ 
ate! ) method fails, the existing database 
Is deleted and then recreated. It’s created 
with the two indexes (on property 1— an 
ascending sort on strings, and property 
2— a descending son on longs) I use to 
test tlie query classes. Finally, 1000 records 
are added with random strings, longs, and 
dates that fall between 1/1/1900 and 
12/28/2100. 

To perform a search, users fill in the 
desired criteria arid select Search, The 
OnSearcb() method extracts the current 
criteria, builds a CEQuery object, opens 
the database using a CCeEindDBDatcibase 
object, and then uses the Find() method 
to gel an iterator for the results. Listing 
Six is the portion of the search routine 
that creates one of the query properties 
in the query object and then performs 
the query. 

For each query operation and test value 
in the form, it tests to see if the user en¬ 
tered a value. If the user did enter a value, 
ii extracts the desired test and value from 
the dialog and creates a CCeDBQueryPmp 
object, ft then adds this object to the CE- 
Qimy object using addFiddO. Once all tlie 
properties are added, ii opens the database 
and calls Find() to gel the iterator. A sim¬ 
ple for/next loop retrieves the matching 
records from the item ter. It uses the MFC 
TRACE() macro to display the properties 
of the matching records and search statis¬ 
tics in the debugger output window. 

To test the classes, 1 performed a se¬ 
ries of queries. The tests consist of do¬ 
ing single and then range searches on 
the string, long, and date fields, and then 
equality tests on each. Finally, a query 
using all three properties is done. For all 
these tests I tried selecting no sort order, 
a son order that did not correspond to 
the query properties, and finally one that 
did. l he results are encouraging. In cas¬ 
es where J did not select an index, but 
queried using the string (Greater Than, 
Equal tests) or integer (Less Than, Equal 
tests) properties the correct index was 
selected and the search completed with¬ 
out examining all records. In the case of 
range tests the fewest possible records 
w ere examined. Adding tests on the dale 
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property properly excluded non match¬ 
ing records. If I did a test using the long 
property but selected the string sort or¬ 
der, records were returned in the correct 
order hut an exhaustive search was (cor¬ 
rectly) performed. In other words, the 
classes meet all the desired objectives. 
Figure 2 is output from a query includ¬ 
ing range tests in all three properties. 

Conclusion 

The classes I’ve presented here add a pow¬ 
erful search capability to the basic database 
classes found in Windows CE 2,0. They’re 


The SDK 2.0 adds 
several classes to 
MFC that wrap the 
database API 


easy to use, provide fairly optimized 
searches f and allow for testing on every 
property in a database—not just strings 
and integers as is supported by the base 
class. The memory overhead of using die 
classes is reasonably low— a typical query 
object and iterator require only one to two 
KB of memory over the base classes. 

There are some obvious enhancements 
that are possible: I could implement a class 
that would allow for Boolean operations 
for combining query properties {AND, OR, 
NOT) rather than die implicit AND I use 
now. That would require a more complex 
matchQ method. I could add more test 
operations including substring and regu¬ 
lar expression matching. Ihat would ex¬ 
pand the mutchFwldO method. There Is 
one additional query optimization possi¬ 
ble as well: The case where the query 
specifies a Less Than (or Greater Than) 
test on an index that is ascending (or de¬ 
scending) could use the index, and ter¬ 
minate the query as soon as a key value 
fails. This is basically half the range test 
I've already implemented, and it would be 
easy to add to the testOnO and matcbO 
methods. Even without these enhance¬ 
ments, the query classes are a powerful 
addition to my programming toolchest. 


DDJ 

(listings begin on page 110.) 
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Dave hasn’t 
since 1QQ4, 



That’s a fact too. 


Four years ago, Dave Cawlfiefd at Dim Chemicals replaced 
expensive PLCs with OMNX Open Control Software and the 
QNX Realtime OS. "Since then f ” says Dave, “we've 
upgraded the control system regularly with new hardware 
and so ftware - including parts of the OS itself But not 
once have we had to reboot" 

For a handy 12-point checklist on OS reliability, 
download Dave's paper ; Which OS for PC-based Control?, 
at www.omnx.com/productinfo/technicai papers, htnr 




QNX* Microkernel Architecture 

The QNX OS on Dave's machine 
runs every OS component in its 
own MMU-protected address 
space. So if a driver - or 
virtually anything else - fails f 
the rest of the system stays up. 


- Deterministic realtime performance 

11.95 psec per context switch on a Pentium 133) 


Build Reliable Embedded Systems with QNX 

Most operating systems work fine - until they hit a minor software glitch. Or 
until you perform a simple maintenance task, like changing an input device. 
Then, like it or not, the OS goes down - and your application with it. 

With QNX, on the other hand, your system can recover from software 
faults, even in drivers and other critical programs. What’s more, you can hot- 
swap peripherals. Start and stop filesystems and network services. Change 


— Full MMU support for all processes 

— Small memory footprint 

• Fault-tolerant networking 

• Inherent distributed processing 
“ Internet and mobile SDKs 

• Embedded GUI 8 Browser 

— POSiX certified 

— Embedded OEM pricing 


I/O drivers. Add or remove network nodes. Even access the OS after a hard 
disk failure. All without a reboot. 

And here's the kicker: QNX scales seamlessly from handheld consumer 
appliances to continent-spanning telephony networks. So you can use one OS 
to keep all your solutions running 24 hours a day, 7 days a week - nonstop. 

And that’s a fact. 


www.qnx.com (don’t miss our demo!) 

call 800 676-0566 ext. 1364 



The Leading Realtime OS for PCs 


afrX Snriware Systems LUJ. Vdic-e: +t Stn-0&3i fpis (14 5&1-J57H Email mra&qm.eom 

gnv Turaiif wum- don j?:* .w + p fin run -final Fm Fimad- flNIhfimui^ftiini: 

C gin Imitoaaii Sulim III war dux * a ripilnil li iliiml f> linlL Simm* ItlUH LU M l»MH hllNIWM bW«l| •» wgirnii 


AD LINK 134 












EMBEDDED SYSTEMS 


Fast Memory 
Allocation 


Power-of-two for 
the PowerPC 403 


H. Thomas Richter 

■fe ower-of-two memory allocators waste 
mM a substantial amount of memory—- 
up to 50 percent in the worst case. 
In embedded- controller projects with 
execution-time constraints, however, it’s 
okay to sacrifice memory for speed The 
power- of- two Fast Memory Allocator 
(FMA) I present here is used in just such 
a project. 

T developed the FMA as part of a hard¬ 
ware/software project that controlled dig¬ 
ital and analog I/O lines between an IBM 
PowerPC 403-based coprocessor card and 
machines. (The PowerPC 403 is a family 
of 32-bit general-purpose embedded pro¬ 
cessors with data and instruction cache, 
MMU, DMA, fast interrupt latency, and 
low-power consumption; for details, see 
http7/www,chips, ibm.com/products/ 
embedded/,) We used the COP4G3 co¬ 
processor card to run OS Open, an IBM 
internal POSIX-compliant real-time oper- 


Thomas, an engineer for IBM Ent wick- 
lung GmbH, can be contacted at tichtem 
vnet. ihm.com. 


ating system. (Object code for OS Open 
is freely available at http://www,chips,ibm 
.cotn/prr ducts/ppc/dcicumems/.) Attached 
to line coprocessor card was an InterBus-S 
field bus to read and w rite digital and ana¬ 
log I/O lines. The coprocessor card com¬ 
municates via TCP/IP over an ISA bus with 
the Windows-NT hose Machine control 
programs were developed on Windows 
NT (using tools from a third party, which 



ported its tools to the coprocessor card 
and OS Open) and downloaded to the co¬ 
processor card for execution. We were 
able to run complete machines, such as 
the FESTO MPS, with this card and soft¬ 
ware, (The FF8TO MPS, manufactured by 
FESTO Inc,, is a machine that uses ana¬ 


log and digital I/O for manufacturing met¬ 
al parts.) 

The applications that we developed 
needed: 

* A fast, dynamic memory-allocation 
scheme to cope with burst requests and 
releases of equal-sized memory blocks. 

* Many processes to share C++ objects 
with virtual functions. (Classes con¬ 
taining virtual functions maintain a vir- 
tual function table that had to reside 
on the same address location for all 
processes,) 

To meet with these requirements, I 
tweaked the McKusic-Karels algorithm 
(also known as the “BSD 4.4 allocator 11 ) 
to optimize memory management for 
blocks fitting on a single page. (Requests 
spawning multiple pages are rounded 
up to the next page boundary. ! don’t 
currently support them, but support can 
easily be added using bitmaps.) Every 
memory request lower than or equal to 
the underlying operating system's page 
size is rounded up to the next integer 
y with y=2 x i >4, y<page size of OS. 
This policy is used, for example, in the 
buddy-storage allocator (described by 
DonaId Kn ulh in The Art of Computer 
Programming, Volume If UNIX BSD 4.4 
kernel heap system, and AtX operating 
system. 

The disadvantage of the BSD and AIX 
implementations, however, is that the 
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Are you an embedded programmer who likes everything at your 




With Phar Lap's TNT Embedded 
ToolSuite®, you get the convenience 
of a w inning combination for building 
embedded and realtime software. Use our 

RTOS with Microsoft® Visual C++® 5.0 
and the Developer Studio IDE and get 
a total embedded development tools 
solution!, Edit it Build it Assemble it 
link it. Cross-debug it Check on-line 
help. And do it all from within a single, 
familiar, time-tested IDE, 

Version 9,0 of the TNT Embedded ToolSuite lets you do what no other package does—build 
realtime software with Visual C++ and Developer Studies the most widely used compiler-lDE 
package available. Everything you need at your fingertips. Everything just mouse clicks away on 
the IDE toolbar 

The Only Hard Realtime Kernel that Works with Visual C++ 5,0 

Filar Lap's Embedded StudioExpress™ add-in is the bridge between our Realtime lTTS rM Kernel, 
Visual C++ and Developer Studio. It lets you use die integrated Developer Studio debugger 
as a cross-debugger with full hardware and kernel awareness, Its all just an icon dick 
away on the Developer Studio toolbar. 
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Realtime 

Embedded 

Development 


With support for 32-bit xBfi hardware, 
standard nehvorkrng/lnturnet protocols, 
and Win32® development took the 


technology to enhance your productivity. 

Visual C++ Comes Optimized 
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Ihe 32-bit x86 Experts 


Visual C++ has all the right stuff for 
embedded development: an inline 
assembler, I/O port access Junctions, 
structured exception handling and re-entrant RGMable runtime libraries. And its optimizing 
code generator is second to none when it comes to creating fast, tight code. 

Phar Laps support of Visual C++ and Developer Studio gives you a total embedded development 
tools solution that is Simply On Target™ 

Get Web-ready with the ToolSuite! See ourMicroWeb Server technology for embedded systems in 
action at the "World’s Smallest Web Server": [ 
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Page Table 



Figure I: Allocation of 32 bytes. 


pointer-base address of memory region 
page size 


Figure 2: Dividing by page size * 


Ox30000F E0- Ox 3QQQQQQQ 
4096 


Figure 3: Subtracting address from l?a$e address of the 
memory region. 


31 4 3 0 


Left-most 28 bits 

size 

of address 



Figure 4; Bit layout of page-table entry. 


(continued from page 78) 
buddy-storage allocator is time intensive, 
making it less suitable for real-time ap¬ 
plications. Also, the UNIX BSD 4.4 mem¬ 
ory allocator does not unite returned 
memory blocks, which leads to fragmen¬ 
tation. 

The FMA algorithm presented here 
overcomes these disadvantages. Unused 
and returned memory chunks are unit¬ 
ed when possible, thus avoiding frag¬ 
mentation. This is accomplished with 
very little overhead, so it is suitable for 
real-time applications. The complete 
source code for the FMA (for ADO Is avail¬ 
able electronically; see “Resource Center," 
page 3. 

The Algorithm 

The FMA and McKusic-Karel algorithms 
use several structures: 

* A continuous region of memory* The 
region is divided into pages and starts 
on a page boundary, which is four KB. 
Memory requests from applications, 
which must fit on a single page for the 
FMA to be handled, are rounded up 
to the lowest power-of-two number 
(16 bytes minimum). Thus, a four-KB 
page can be divided into chunks of 16, 
32, 64, 128, 256, 512, 1024, or 2048 
bytes without waste. Full pages are 
also possible. 

* A page table* It has as many entries as 
there are pages in the memory region. 

* The freelist. For each aUocatable size 
possible, there is an entry in die free¬ 
list* It chains together blocks of the 
same size. The freelist is an array of 
eight elements. 

* Pointers and counters for internal 
housekeeping* Last-used pointers in¬ 


dicate the last-used page in the region; 
anchor pointers indicate a released 
page in the region. And semaphores 
are employed to synchronize shared- 
access and maintain shared-memory 
descriptors* 

When an application starts, it creates a 
memory region via a call* The application 
specifies the size of the region (if it has 
to be created) and whether it is shamble 
or private* 

Memory is requested and released with 
subroutine calls identical to the malloc 
and free interface* The size of memory 
needed is specified in requests* and a 
pointer to a block of at least that size is 
returned. This address is used when die 
block is returned to the system. 

McKusic-Karels 

In die McKusic-Karels allocator algorithm, 
memory requests are satisfied from the 
corresponding freelist entry after die cor¬ 
rect size has been determined. Each en¬ 
try in the freelist is a pointer to a chain of 
free memory blocks of the same size. A 
null pointer indicates an empty list. If a 
request hits an empty list, a new page is 
obtained from the memory region* The 
page is divided into blocks that form a 
single linked list with the freelist entry as 
an anchor. The page-table entry for each 
page contains the size of the memory 
blocks the page was divided into* Figure 
1 illustrates the situation after the first 
memory request of 32 bytes* The start ad¬ 
dress of die block is returned to the ap¬ 
plication. 

When the block is later released, the 
base address of die memory region is sub¬ 
tracted From the current address, and the 
result Ls divided by the page size* The re¬ 


sult is die index to the page table that con¬ 
tains the size of die memory block when 
it was originally allocated; see Figure 2* 
Tile block is then inserted into the corre¬ 
sponding freelist. Since the page size is 
usually a power-of-two number, the divi¬ 
sion can lie replaced by a right-shift op¬ 
eration. 

Assume the memory region was allo¬ 
cated at address 0x30000000. With a 
page size of four KB, this first page is 
mapped at address 0x30000000, the sec¬ 
ond at address 0x30001000* and so on. 
Further assume that the application re¬ 
quested a 32- byte block and received a 
pointer to address Gx30000FEO. If this 
pointer is now returned to the memory 
system, the address is subtracted from 
the base address of the memory region; 
see Figure 3- The page-table entry for 
page zero lias a value of 32, so the block 
Ls inserted in the free list array at element 
number one. 

With many memory allocations and 
frees, each chain contains pieces from dif¬ 
ferent pages, Memory is never united to 
form bigger blocks* A burst of requests 
for 16-bytes memory blocks renders a large 
area of the memory region useless for big¬ 
ger requests even after all blocks have 
been returned. 

FMA 

The FMA stores additional information in 
the page-table entries and unallocated 
memory blocks; 

* Three bits are used to identify the 
block size into which a page has been 
divided. The bits represent the index 
into the freelist array and indirectly re¬ 
veal the block size of each element in 
the list* 
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• With the minimum block size set to 16 
bytes per request, all blocks are aligned 
on a 16-byte boundary (or multiples 
thereof). The last four bits of any block’s 
start address are zeros, and can be used 
for other purposes; see Figure 4, Each 
page-table entry is 32 bits wide, the 
same size for a pointer on a 32-bit pro¬ 
cessor, lire size field of a page-table en¬ 
try contains the index to the freelist ar¬ 
ray. With eight different sizes possible, 
four bits are sufficient. The address part 
contains the high-order 28 bits of the 
entry block start address (see next item). 
To form usable addresses, it must be 
multiplied by 16. 

* Returned memory blocks can be used 
to store additional internal informa¬ 
tion, When a page is divided into 
blocks of a particular size, the corre¬ 
sponding freelist is empty. The first 
block of a page, called the "entry" 
block, contains the number of re- 



Figure 5: Splitting page into 32-byte 
blocks. 



Figure 6: Splitting second page into 
32-byte blocks, 


maining blocks in the chain (includ¬ 
ing itself, a pointer to its successor, 
and two pointers for a double-linked 
list). The pointers for the double-linked 
list are used to chain entry blocks of 
the same size. Entry blocks are refer¬ 
enced from the freelist and from the 
page-table entry. 

Now assume the application requests 
a 32-byte block. The system allocates 
the first page and divides it into 128 
blocks. The first 127 blocks are chained 
together to form a single-linked list. The 
entry block contains the number of 
blocks on this chain. The address of the 
entry block is stored in the freelist array 
at element one. The high-order 28 bits 


block counfc 


page size 
block size 


Figure 7: Dividing page size by 
block size , 


of the entry block’s address are also 
stored in the page-table entry, as is the 
index into the freelist array. The last 
block on the page is returned to the ap¬ 
plication. Figure 5 shows the state of the 
system at this point. 

If another 3 2-byte block is requested, 
the system finds a valid pointer on the 
freelist array at element one, and returns 
the block following the entry block. The 
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block count in the entry block Is decre¬ 
mented to reflect this. 

If the block count drops to zero, the 
entry block itself is returned to the ap¬ 
plication. In this case, the freelist array 
element and the address part of the cor¬ 
responding page-table entry must be up¬ 
dated. They will both be set to zero. The 
index of the page-table entry is calculat¬ 
ed with the formula Figure 2. 

If the application requests further 32- 
byte blocks now, a new page is allocated 
and the same scenario repeats. Hie free¬ 
list entry points to an entry block; this time, 
however, it Is located on a different page 
(see Figure 6), 

Release of Memory 

If a tnemory block is released from the ap¬ 
plication, the page-table index is calculat¬ 
ed using die formula in Figure 2. The ad¬ 
dress part of the page-table entry can 
behave in two ways: 

• If it is not zero, it contains the high- 
order 28 address bits of an entry block 
for this page. The new block is ap¬ 
pended after the entry block and the 
block count is incremented. If all blocks 
of a page have been released, the page 
can be reused. 

* If it is zero, the first block of a page in 
use has been returned (this can be any 
block from that page), li will be the 
new entry block for that page. The 
page-table entry for this page refers to 
that entry block, and the block count is 
set to one. If no freelist entry for this 
block size exists, then the block is the 
only one available and the freelist en¬ 
try refers to it. 

If the freelist entry is already pointing 
to an entry block, the new entry block is 
the new head of the double- linked list. 

Assume the application returns its first 
requested block (which resides on the 
first page). The page-table entry for this 
page indicates a block size of 32 bytes 
with zero for the address part, making 
this block the entry block. The freelist 
entry for 32-byte blocks is valid, so the 
entry blocks must be chained into a 
double-linked list, as in Figure 8, If an¬ 
other block from the firs? page is released, 
the address part of the page-table entry 
points to a valid entry block, and the 
newly released block is simply append¬ 
ed. The block count in the entry block is 
incremented. 

Reuse of Pages 

If all the blocks residing on a single page 
have been returned, the page can be 
reused (there is no need to unite adjacent 
pages since memory requests spanning 
multiple pages are not currently support¬ 


ed). All blocks have been returned when 
the block count in the entry block is the 
same as the page size divided by the block 
size; .see Figure 7. 

Since page size and block size can be 
expressed as power-of-two integers, the 
division can be replaced by subtraction. 
If the page is not used anymore: 

* The entry block must be removed from 
the double-linked list of entry blocks, if 


The FMA allocation 
stores additional 
information in the 
page-table entries 


die freelist dement refers to diis entry 
block, it is updated to point to a suc¬ 
cessor (or Is set to null if no more en¬ 
try blocks are on the list). 

• The corresponding page-table entry 
is no longer referred to and can be 
reused to form a single-linked list of 
free pages. 

The system maintains an anchor to the 
first free element in the page-table entry 
(named anchor ), and a pointer to the last 
page allocated From die memory region 
(called last used ); see Figure 7. las fused 
points to the Iasi page allocated in die re¬ 
gion, If a new page is requested and an¬ 
chor is null, las fused is incremented to the 
next page boundary until the end of the 
region is reached. If anchor is not null, it 
points to tlie page- table entry of a reusable 
page. The page-table entry contains a suc¬ 
cessor (if any), or -1 (which terminates 
the linked list), 

For example, assume that the appli¬ 
cation has issued more memory requests 
and several pages have been allocated. 
Now assume memory is released so that 
pages numbered zero, four, and two can 
be reused (in that order). First, the cur¬ 
rent value of the anchor is written into 
page-table entry zero, and the anchor 
points to page-table entry zero. Then 
page four is released. This is the last 
page of the region, pointed to by las- 
tused , so last used is simply decrement¬ 


ed by page size. Page two is released, 
and this page-table entry is the new head 
of the linked list of free pages; see Fig¬ 
ures 9 and 10. 

If die number of used pages in the re¬ 
gion drops to zero, all pages have been 
released, lastused points to the beginning 
of the region, and the anchor is set to 
zero. In other words, the region is reini¬ 
tialized. 

Full Page Requests 

If a memory request is rounded up to a 
full page, a free page is obtained from 
the region as previously described. The 
size field of the page-table entry is set 
to eight, indicating that the page hasn't 
been divided into blocks. If the page is 
later released, its size indicates that it can 
be returned to the memory region im¬ 
mediately. 

Overhead 

The space overhead is very low. The 
page table consists of four bytes per en¬ 
try. For a 256-MR region, 65,536 page- 
table entries are needed, assuming the 
page size is four KB. This equals 256 
KB or 0.00098 percent. The size for the 
freelist and the other internal counters 
and pointers can be neglected. This 
amounts to less overhead than you get 
with heaps maintained by traditional 
linked lists. 

Performance and Memory Utilization 

I conducted performance tests for the BSD 
4.4 algorithm, the FMA, and the AIX 4,1.4 
system heap. The sequence of memory re¬ 
quests and releases and the sizes allocat¬ 
ed per request differ from application to 
application. There is no paltem that is valid 
for all possible scenarios. The test program 
(available electronically), therefore, ran¬ 
domly requests and releases previously al¬ 
located blocks. The requested size is also 
determined randomly, within the follow¬ 
ing limits: 

• A block is never larger than the page size. 

• Block requests of a particular size may 
not exceed the boundaries shown in 
Table 1. 


Block size 
in Bytes 

Percentage 

16 

10 

32 

30 

64 

30 

128 

15 

256 

6 

512 

5 

1024 

2 

2049 

1 

4096 

1 


Table 1: Percentage of memory block 
sizes , 
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(continued from page 82) 

Ten percent of all calls request a block 
size of l6 bytes, 30 percent request a 
block size of 32 bytes, and so on. The 
test program keeps track of up to 5000 
allocated blocks. If the table overflows in¬ 
stead of randomly releasing blocks (which 
can happen due to the way blocks are 
selected for release), all allocated blocks 
are freed before the next block is allo¬ 
cated. This reflects the behavior of many 
applications that allocate memory blocks 
to build tree or list structures and never 
release them, They rely on die operating 
system to release die requested memory 
when the program terminates. I ran the 
test program on an IBM Thinkpad 850 
with a PowerPC 603e processor running 
at 166 MHz and 96 MB of memory. The 
operating system AIX 4.1,4 was in single- 
user mode. 

Table 2 lists the difference in run time 
when a heap-memory management sys¬ 
tem is active. First, a sequence of mem¬ 
ory requests (including size) and releas¬ 
es is calculated Tills sequence is executed 
twice—first, without actually issuing 
memory-heap calls; and second, execut¬ 
ing memory-heap calls. Table 2 shows 
the time difference (in milliseconds) be¬ 
tween both executions. Table over How 
indicates how often the application has 
freed all allocated blocks before request¬ 
ing a new one. 

As Table 3 shows, the FMA achieves 
far better memory utilization than the 
BSD 4,4 allocator. The FMA saves about 
49.5 percent of memory when allocating 
up to 60,000 memory blocks. The per¬ 
centage of memory savings decreases as 
the number of blocks allocated increas¬ 


es, Tliis is due to the fact that die test 
program only keeps track of 5000 allo¬ 
cated blocks. 

Interfaces 

I haven't altered die traditional C interfaces 
malloc and free, or the C++ interfaces new 
and delete, for dynamic memory allocation. 


The FMA achieves 
far better memory 
utilization than the 
BSD 4.4 allocator 


Applications can be developed and tested 
with standard tools and debuggers, and 
some vendors offer special l leap debug¬ 
gers. When finished, the operators for new 
and delete or the functions malloc Lind free 
are simply replaced. 

The C interface consists of the follow¬ 
ing function calls: 

* int f mainitisizeJ, ini char); creates 
and initializes the region, The first pa¬ 
rameter specifies the size in bytes. The 
second parameter determines the char¬ 


acteristics of the heap. The region can 
be allocated in shared memory or in 
the process’s private data segment. 
Shared memory is mandatory if the 
heap is to be shared between pro¬ 
cesses, Information on whether access 
is private or shared is located in a 
shared-memory segment . If access is 
shared, memory requests are serial¬ 
ized (to protect internal data) using 
semaphores. The region can be pinned 
in memory. However, certain restric¬ 
tions apply, such as permissions (root 
user) and real memory restraints. The 
third parameter indicates that the re¬ 
gion has to be created (if it doesn’t al¬ 
ready exist). 

• intfmadownO; removes access to the 
heap. No further requests can be made 
after this call. Pointers into the memo¬ 
ry region may be invalid and could 
cause core dumps. If the last process 
sharing the heap calls fmadown , the 
shared segment is removed from the 
system. 

• Void *fmaal!oc(u nsigned long); requests 
a block of memory of a particular size. 
The address of the block is returned (or 
null is returned if no more memory is 
available). 

• void fmafreef mid *); returns a block. 
The parameter is the address of the 
block as returned from fmaalloc. 

C++ interfaces use a structure that 
must be defined before the first call to 
new ; see Listing One (listings begin on 
page 87). 

The first cal! to new creates the mem¬ 
ory segment according to the values de¬ 
fined in the structure. The members size. 


Page Table 



Figure 8: First block requested is returned , 
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Figure J): Page-table entries after more requests, 
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Figure 10: Page-table entries 2 and 0 form linked list , 
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(continued from page 84) 
flags , and key have exactly the same mean¬ 
ing as in the fmaintt call. 

While the first call to new creates a 
memory region (or attaches to an exist¬ 
ing one), the last delete should delete or 
detach from it. Since automatic detec¬ 
tion of the last delete is not possible, a 
little help from the application is need¬ 


ed. When the member removejnow is 
set to True, the next delete call will de¬ 
tach from the region and destroy it if the 
application was the last one using it; see 
Listing Two, 

If the program Is compiled with die “de¬ 
fine FMA" set, it will use the FMA, and must 
be linked with the FMA version of opera¬ 
tors new and delete. Otherwise, it will use 


the standard C++ operators supplied by the 
vendor's library. 

Conclusion 

The FMA is about 40 percent slower (on 
average) than the BSD 4.4 allocator, but 
offers the advantage of uniting released 
memory blocks and reusing pages, re¬ 
sulting in memory savings of nearly 50 
percent. The AIX 4.1 A heap allocator is 
listed as reference only. It is a general- 
purpose allocator that sacrifices speed 
for space optimization. Real-time ap¬ 
plications need deterministic behavior, 
so the upperhound for memory alloca¬ 
tion (FMA:4300ns, BSD4.4:34G0ns, as 
shown in Table 3) is of particular in¬ 
terest. 

As it turned out, in my example, there 
was no need to implement support for 
memory requests spawning several 
pages. 
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Blocks 

Requested 

FMA 

Run-Time Difference 

BSD 4.4 AIX Heap 

Table 

Overflow 

5000 

12ms (2400) 

Sms (1600) 

82ms (16400) 

0 

10000 

32ms (3200) 

20ms (2000) 

206ms (20600) 

0 

15000 

62ms (4133) 

48ms (3200) 

366ms (24400) 

1 

20000 

74ms (3700) 

58ms (2900) 

502ms (25100) 

1 

25000 

98ms (3920) 

72ms (2880) 

578ms (23129) 

1 

30000 

126ms (4200) 

100ms (3333) 

706ms (23533) 

1 

40000 

152ms (3800) 

134ms (3350) 

938ms (23450) 

2 

50000 

192ms (3840) 

160ms (3200) 

1198ms (23960) 

3 

60000 

254ms (4233) 

190ms (3166) 

1446ms (24100) 

3 

70000 

268ms (3829) 

200ms (2857) 

1604ms (22914) 

4 

80000 

328ms (4100) 

240ms (3000) 

1810ms (22625) 

5 

90000 

348ms (3866) 

278ms (3089) 

2102ms (23355) 

5 

100000 

376ms (3760) 

320ms (3200) 

2240ms (22400) 

6 

150000 

644ms (4293) 

472ms (3146) 

3356ms (22373) 

9 

200000 

784ms (3920) 

642ms (3210) 

4512ms (22350) 

12 

Average 

per Block 

3812 (139.33%) 

2736(100%) 

22712 (830.12%) 



Table 2: Average run time depending on number of blocks requested. 
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IE 


Number of 

Blocks 

Number of Used Pages 

FMA BSD 4.4 

Savings (in %) 

5000 

41 

79 

48,01 

10000 

82 

163 

49.69 

15000 

120 

248 

51.70 

20000 

153 

295 

48.14 

25000 

190 

354 

46.33 

30000 

207 

404 

48,77 

40000 

264 

499 

47.10 

50000 

314 

584 

46.24 

60000 

281 

696 

59.63 

70000 

444 

790 

43.80 

80000 

490 

679 

44.26 

90000 

519 

916 

43.35 

100000 

557 

993 

43.91 

150000 

758 

1255 

39.61 

200000 

935 

1461 

36,01 


Table JL* Memory utilization in page blocks. 
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EMBEDDED SYSTEMS 


Listing One 

struct t_h«appoliey t 
unsigned long aise; 
int flags; 
char hey; 
int renove_iMW.: 


Listing Two 

int main (int *uf£¥ t char **argvO 

£ 

ilfdaf FMA 

t.hflappoliay policy 1); 

i/endif 

int *ip - new int; // Create a private heap of 1 MB 
lifdef FMA 

policy,rHJ»ve_nov = 1+ 

tlendif 

delate ip; 
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INTERNET PROGRAMMING 


Active Data Objects 

& ASP 


Executing scripts 
on a server 

Mark Betz 

T here are times in web development 
when you need direct access to data 
from the web server Perhaps your 
application is a simple catalog or re¬ 
port generator. But even a complicated 
extranet app has its two-tier elements. 
One clear example is the need to man¬ 
age application state. Not long ago, just 
about any web app that did more than 
display pages had to manage its state on 
the server side. This is still the case if your 
app needs to be compatible w ith abso¬ 
lutely every browser ever made* If you Ye 
willing to thin the herd and select a sub¬ 
set of clients* then you can move some 
processing, and thus some state man¬ 
agement, out to the client in the form of 
scripts, controls, and applets. Still, it is dif¬ 
ficult to carry state across pages on the 
client. If for example, you have an HTML 
element that may be in one of two states, 
then its state is probably going to be set 
on the server eveiy time you build a page. 
Otherwise, you would somehow have to 
persist the state on tire client across re* 
quests. This is certainly possible, but the 
available solutions may involve things like 
writing to the local registry or file system, 
which makes users nervous* 

It’s easy, however, to carry state across 
pages on the web server. Early applica¬ 
tions lor the Web used CGI programs that 


Mark is president of Incurrent Chip,, a spe¬ 
cialized commerce service provider and de¬ 
veloper of internet cardholder sen ice appli¬ 
cations for the payment card industry. He 
can he contacted al mbetzfflineu mmt, com 
or httpj/www. inctmmLcom/, 


sometimes wrote to local files or more- 
commonly accessed data on a server 
somewhere. The stored data had to be 
keyed to a particular user (not a trivial 
problem in HTTP), but otherwise, there 
was no limit to what could be persisted 
to die DBMS. CGI programs are proba¬ 
bly still Lhc most common means of ac¬ 
tivating web applications. However, re¬ 
cent technologies focus on extending the 
web server through the use of compo¬ 
nentry in the form of COM, COR BA. and 
Java objects, and, very recently, on Line 



use of server-side scripting environments. 
Server script is particularly suited to ce¬ 
menting the incoming HTTP request to 
the layer of object interfaces that imple¬ 
ments transactional capability* It is also 
very useful for managing application state 
and formatting pages* Page generation is 
handled more readily in script than in 
CGIs (Perl aficionados will complain, 
quite rightly, but Javascript and VBScript 
are certainly more approachable for many 
developers than Peri). Furthermore, the 
pages in a web application are subject to 
frequent change, and scripts provide a 
very malleable interface between the 
pages and die less-mutable parts of Lhe 
system. 


Microsoft's Active Server Pages 

Microsoft’s Active Server Pages (ASP) pro¬ 
vides an environment within which a script 
executes on the web seiver, resulting in 
output that is fed back to die client. ASP 
is compatible with various scripting lan¬ 
guages through die use of COM interfaces 
that expose the scripting engine. Currently, 
the environment supports VBScript and 
Javascript. Active Data Objects (ADO) Ls 
another Microsoft technology, one built 
on top of OLEDB, which is in turn built 
On top of the Open Database Connectiv¬ 
ity (ODBC) Standard. ODBC provides a 
functional interface dial separates data ac¬ 
cess from data providers, and allows ap¬ 
plications to execute SQL queries against 
different server implementations. OLEDB 
provided die COM interface to ODBC, and 
ADO simplified OLEDB wtith an automa¬ 
tion wrapper that was useful to Visual Ba¬ 
sic programmers. With die growing pop¬ 
ularity of ASP as an extension toed for 
Internet Information Server (US), ADO has 
emerged as a fast, reliable way of con¬ 
necting your server script to data. Keying 
application state to a specific user is a 
good example of using ADO wadi ASP. 

Identifying a user so you can grab the 
right data is a problem, because HTTP is 
a connectionless protocol. It is imple¬ 
mented as a set of atomic request/re- 
sponse transactions. The protocol does 
not provide any concept or a client across 
requests. In the context of static pages, 
this is a reasonable choice: It makes the 
HTT P protocol much simpler, and thus 
much more robust. However, the lack of 
connection .state has a profound impact 
on the design of information- or transac¬ 
tion-driven web applications. Consider the 
problem of authentication across two re¬ 
quests. On request one, the server can re¬ 
ceive the user authentication tokens as ar¬ 
guments in a form POST. The tokens can 
be validated against a user database and, 
if the user authenticates, the next page in 
the app can be returned. Request two, 
originating from the returned page, must 
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somehow convey to the server the au¬ 
thenticated state, as well as die identity, 
of die user. When the server responds to 
request two, it will use tills information to 
determine the current state of die appli¬ 
cation. 

There are basically two ways to get user 
information across the connection: 

* Piggyback it onto the HTTP transaction, 
either as parameters to a GET, or name- 
value pairs in a POST. 

* Use cookies. Cookies are short strings 
of text that are generated on the server 
and sent to the client, and are then re¬ 
turned to the server by the client when 
a request is made, 

ASP provides an automation object 
named Session that utilizes cookies to main¬ 
tain connection state. Hie Session object is 
very easy to use. However, it suffers from 
problems with cookies, namely diat users 
perceive diem as a privacy dir eat and can 
disable diem. Cookies were die only means 
Microsoft had for enabling user sessions 
without intruding into die application script. 
The first method, adding user information 
to GETs and POSTs, is more intrusive, but 
more reliable. Since the script-level re¬ 
quirements can be minimized through the 
use of server includes-— blocks of script 
included into a page prior to script exe¬ 
cution— or by encapsulating the needed 
functionality in COM controls, developers 
often prefer this method. It also makes a 
more interesting example of ADO use. 

In this example, there is a set of pages 
that cannot be retrieved by users unless 
they have authenticated, Once authenti¬ 
cated, a user will lie assigned a session 
key, and the key will be used to access 
application state on die web server. The 
session key will be embedded into re¬ 
turned pages as a parameter to any URLs, 
and as a hidden input field in any forms. 
On the server side, the script has die re¬ 
sponsibility of making sure that a con¬ 
trolled page is accessed with a valid ses¬ 
sion key. If a request is received with no 
session key, or an invalid session key, then 
the user must be redirected to the login 
page. What constitutes an invalid session 
key? The system may have been forced to 
invalidate the key due to an error A more 
common reason is simply that die user is 
finished, and no longer needs the key. 
Unfortunately, there is no easy way to 
know when this is the case. HTTP pro¬ 
vides no hint of state across requests, so 
how do we know which request was die 
last one? A logoff button can be provid¬ 
ed, but users can easily surf out without 
clicking it. The only reliable mechanism 
involves a process that times out inactive 
sessions. We won’t get into that aspect of 
the system very deeply here. 


Our database requirements are simple. 
Figure 1 shows two tables. The first is 
named tbl_User. This table has four 
columns. UserlD is an IDENTITY column 
that serves as die primary key. UserName, 
UserPwd, and UserFullName are all char 
columns with self-evident uses. The sec¬ 
ond table is tbl_Session. It has three 
columns. SessionlD is anodier IDENTITY 
column providing die primary key. It isn’L 
necessarily a good idea to use an IDEN¬ 
TITY column to generate session keys. A 
better mechanism is to generate a pseudo¬ 
random 32-bit integer, then hash it against 
the client’s IP address. For this example, 
well keep it simple. The other two 
columns of tbljsession are UserlD , a for¬ 
eign key into tbiJUser, and LastAccessTime, 
which is a smaUdatetime used to control 
inactive sessions. These two tables will 
provide all the storage we need for the 
purposes of authenticating and providing 
session state. In practice you cl have oth¬ 
er tables keying on the session or user 
IDs, and containing various parts of die 
application's state. 

Listing One (listings begin on page 111) 
provides examples of the first two compo¬ 
nents in our app—the login form and die 
login script. When the user clicks die login 
button in usedogin.htm, a POST is execut¬ 
ed against die web server, containing die 
form input field contents, and identifying 
the login.asp script as die target. On entry 
to the login, asp script, we first need to ex¬ 
tract the login parameters from the HTTP 
request. ASP provides an automation ob¬ 
ject called Request . Among die Request ob¬ 
ject's several collections is one called Form. 
The values in the Form collection corre¬ 
spond to die named input fields in die lo¬ 
gin form. Thus, the script <%UserName - 
ReqimtFormniserNameO%> retrieves the 
value that the user typed into the “User- 


Name" edit. The first duty of login,asp is 
to extract the user name and password 
from the Form collection, and get them 
into variables. If this operation fails for 
any reason, the user must be redirected 
back to useriogin.htm. Checking parame¬ 
ters before processing diem is always a 
good idea: The “calls" into your script are 
often URLs that hang out in the browser 
address bar, tempting users to play around 
with them, It’s a good idea to program 
defensively. 

Once die parameters are verified, we need 
to create the database connection. ADO’s 
Connection object represents a connection 
to an ODBC datasource, Opening the 
database connection requires three steps. 

1. First, the connection string is created. 
It contains die ODBC datasource name, 
the name of an authorized user, and the 
user’s password. 

2. Next, the connection object is created. 
There are two primary means of creat¬ 
ing objects in ASP, Eidler die object is 
inserted into the page using the HTML 
OBJECT tag and the RLINATSERVER" 
parameter, or die Server object is used 
as a factory. The Server object is one 
of the built-in ASP objects, and its Cre- 
ateObject method can create an instance 
of any scriptable automation object. It's 
a bit cleaner than the OBJECT tag, so 
it is the path we’ll take. The only pa¬ 
rameter we need is a string identifying 
the class and interface we want—in 
this case, the ADODR class and the 
Connection interface. The result of die 
call is assigned to the Conn variable. 

3- Finally, the Connection object’s Open 
method is called and passed the con¬ 
nection string we built previously. If all 
Is successful, when this methexi re tunas, 
the Conn variable holds an open 



Figure 1: Sample database tables. 
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connection on our datasource, L Should 
an error occur anywhere in this process, 
the ASP engine wiH cease ev aluating the 
script and return an error to the client. 

With the connection open, the first or¬ 
der of business is to authenticate the user 
by looking up the user name and pass¬ 
word in the database. A simple SQL SE¬ 
LECT statement suffices in this example. 
If no record containing both the name and 
password is found, then the authentica¬ 
tion fails and the user would normally 
bounce out to an error page, in this case, 
we'll just return them to the login screen. 
To execute the SQL statement, we first 
build it into a string. This process is fa¬ 
miliar to anyone who has programmed 
SQL Server using Microsoft's APIs. Once 
the query 7 string is constructed, it is passed 
to the Execute method of the Connection 
object, The Execute method is capable of 
running any SQL statement that is sup¬ 
ported by the data provider. Since ours is 
SQL Server, we’ve got a wide range of 
functionality available. Later, we ll see an 
example of compound statements, multi¬ 
ple result sets, inserts, and so forth If 
you’re keeping an eye on portability, you 
don't want to get too fancy, and in any 
case, it is evident from reading the listings 
why restricting the use of ADO and ASP 
to managing application state and other 
local web server tasks is a good idea, ADO 
enshrines the data schema in die appli¬ 
cation source code. 

The result of a successful call to Exe¬ 
cute is an interface on a RecordSet. Hie 
RecordSet object is ADO’S primary inter¬ 
face to data on a server, with a number 
of properties and methods that provide 
full cursor access to result sets. Note that 
not all operations are available from ev¬ 
ery data provider Often, if an operation 
is not supported, the result is a NULL, or 
other ambiguous value. In the case of the 
AbsolutePosition property, for example, a 
return value equal to die constant adFos- 
Unknoum may mean the RecordSet is emp¬ 
ty, the current position really is unknown, 
or die provider does not support the ab¬ 
solute position property. The moral is: 
Know your datasource. If you’re keeping 
things relatively simple, you won't have 
any problems. At the moment, our needs 
are very simple. If Conn-Execute com¬ 
pletes successfully, then die RecordSet is 
assigned to die variable RS, and the script 
in login,asp checks to see if the RS.EOF 
property is True. EOF is set to True when 
the current position is one after the last 
record returned, and so is True for an 
empty result set. True, in diis case, means 
the user record wasn’t found, so we send 
the user out to userIogin.htm again. If 
RS.EOF is False, then we retrieve the se¬ 
lected UserlD into a variable by query ing 


the RecordSet Field collection. Fields are 
available for die current row r and may be 
accessed by name or ordinal position. In 
diis case, we ask for UserlD by name. 

With the user authenticated and the 
database TD in hand, the script has one 
more check to make before creating the 
session. Since this is a web app t die user 
may be trying to log on again while a ses¬ 
sion is still active for him, To prevent this, 
the script queries the session table to see 
if the UserlD is present. If it is, then the 
selected session key can be reused in die 
redirect to the welcome page. Assuming 
the user does not have an active session, 
die script must create one by inserting a 
new record into the session table. We not¬ 
ed earlier that SQL Server was going to 
generate die session keys for us through 
die use of an IDENTITY column in tbi_Ses- 
sion , An IDENTITY column is automati¬ 
cally stuffed with a unique value when an 
INSERT takes place on die table. Since we 
need the session key, we also need to get 
our hands on the value of that column 
right after the INSERT. Fortunately, SQL 
Server supplies the ©©IDENTITY global 
that contains the last generated value for 
an IDENTITY column. Unfortunately, 
©©IDENTITY is only valid until the next 
INSERT against a column without an 
IDENTITY attribute occurs. We don’t know 
when that will happen. Therefore, we’ll 
use a compound statement to insert the 
session record and grab the IDENTITY 
column value all at once. 

The use of a compound statement 
gives us a chance to look at how the 
RecordSet object handles multiple result 
sets. Tile statement itself is simply a batch 
of commands separated by semicolons. 
The first command in our batch is an IN¬ 
SERT. What we’re inserting is the UserlD 
and the initial value for Last Access Time. 
SQL Server generates the SessionTD, up¬ 
dating ©©IDENTITY to tile new value, 
which is selected out by the second com¬ 
mand of the batch. Assuming Conn .Exe¬ 
cute completes successfully, RS will con¬ 
tain an empty RecordSet resulting from die 
successful INSERT. Since we executed a 
batch, we know that there is another 
RecordSet, To get to it, we call NextRe- 
cordSet on our existing RecordSet. What 
we get back, hopefully, is a new Record- 
Set that is positioned at the Erst row in the 
results from the next command in the 
batch. At this point, all that remains Ls to 
extract the SessionID by querying for RS(0), 
We use an ordinal here because our SE¬ 
LECT did not assign a column name to 
the result. RS(0) accesses the first field of 
die current row, which is our selected 
identity value. The last step in die process 
simply redirects die browser to die wel¬ 
come page with a URL that includes the 
session key. 


The welcome page makes its way into 
our example because it demonstrates the 
basic tasks that all session- controlled pages 
must accomplish when a user requests 
them: checking for a valid session, up¬ 
dating the session’s LastAccessTime col¬ 
umn, and using die validated session to 
retrieve user information. All of these 
things happen in welcome,asp, which is 
presented in Listing Two. In essence, this 
file is a template from which any other 
session-controlled page can easily be built. 
Normally, we would take the session val¬ 
idation script and move it out to an in¬ 
clude file that can he parsed into any page 
dial needs die ability to verify a session. 
Once again, though, well try to keep it 
simple for the purpose of illustrating how 
ASP and ADO work together to handle 
these tasks. The welcome.asp page also 
demonstrates the combination of HTML 
and ASP script dial is the .strength, and die 
weakness, of server-side scripting. The 
strengths lie in the ease with which out¬ 
put can lx* generated and fed back to die 
client, using all of the power of Visual Ba¬ 
sic’s string handling. The weakness is that 
it can be difficult to combine these ele¬ 
ments in a way that leaves a structured 
and readable source file. Fortunately, in 
this case, nearly all die script precedes die 
HTML in the file. 

In this script, as with the login script, 
the first thing to do is certify that the re¬ 
quired parameters are present in die query 
string. In a more complicated application, 
we would probably have to do some val¬ 
idation on the types and ranges of die pa¬ 
rameters as well. Fortunately, all we are 
concerned with here is that the sesskey ar¬ 
gument is present. It doesn’t really matter 
what the type of its value is* One of the 
nice diings about working in a script en¬ 
vironment where all variables are of type 
VARIANT is that you can almost always 
compare what you have against what you 
want, If we want a numeric session key 
and [lie user edits the URL to present a 
siring instead, the result is the same as for 
any other invalid session key, (Of course, 
working widi nothing but VARIANTS caus¬ 
es lots of problems, too, but that's a top¬ 
ic for another time.) Once the script has 
determined that the session key is pre¬ 
sent, it has to validate the session. A ses¬ 
sion key is valid if it is the primary key to 
a record in the session table, and the 
UserlD column of that record is a foreign 
key to a valid record in the user table. 

We could easily wrap these two re¬ 
quirements into a single query, by joining 
the tblJJser and thl_Session tables on the 
UserlD column. We won’t do this because 
it requires diat die data provider support 
joins. The business logic of a real appli¬ 
cation may need to set and enforce this 
requirement, but we prefer to keep the 
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simpler application-state management as 
portable as possible. Before executing the 
statements to validate the session, the script 
creates a database connection exactly as 
in the login script. This is the kind of log¬ 
ic that can be extracted into an include 
file, since the same connection string is of¬ 
ten used throughout an app. It might not 
he a good idea if, for example, die user 
also has database privileges, and you want 
to build the actual user ID and password 
into the string. In tliat case, you’d proba¬ 
bly build the string when the session was 
created and store it. Our example envi¬ 
sions a case where die application and 
database security models are separate. Re¬ 
gardless of where the user's ID and pass¬ 
word come from, there are other param¬ 
eters that you shouldn't liave to retype into 
every page. In practice, we’ve found it 
handy to store information such as the 
datasource name, login ID, and login pass¬ 
word in die system registry on the web 
server or another host on the back-end. A 
simple COM control can extract this in¬ 
formation into script variables, so that the 
pages can be built genetically. 

With the connection in hand, the script 
executes a SELECT to get the UserlD from 
the record in tbl_Ses$ion that is identified 
by die session key passed in the URL. If 
diis record is not found, dien die session 
key is invalid. If die record is found, the 
returned UserlD is extracted from the 
RecordSet object in RS, The next test is for 
a valid user record, The script performs 
this test by selecting out die UserFullName 
column, which will be used later in the 
welcome message. A failure here means an 
error in the integrity of the database: There 
should Lie no records in the session table 
with a UserlD column that is not a valid 
foreign key Into the user table. In fact, if 
we f re using a full-featured database such 
as SQL Server to store die application state, 
the UserlD column of (bI_Session will lie 
constrained so that only valid foreign keys 
can be inserted into it. Such a constraint 
would allow us to select the UserFullName 
column without any fear diat the operation 
could fall due to an invalid user ID. Since 
we are not assuming such a constraint here, 
we check to see if the returned RecordSet 
is empty. If it is, then the script makes use 
of the Response object to write an error 
message to the client, If all goes well, the 
script completes processing and die vari¬ 
able UserFullName Ls used in the body of 
the HTML to extract the user's name into 
the welcome message. This illustrates one 
of die more powerful ways in which ASP 
script integrates with the HTML output 
stream. 

Conclusion 

Active Server Pages are useful for gener¬ 
ating output and managing application 


state on behalf of a client. When com¬ 
bined with Active Data Objects, your script 
can manipulate ODBC dam sources to do 
nearly anything that is possible in native 
SQL, In our applications, we have used 
ASP and ADO to execute stored proce¬ 
dures, create temporary tables, and wrap 
up transactions. Still, it is important to rec¬ 
ognize this model's limitations, which flow 
more from the limitations of scripting lan¬ 
guages than from limits on what ADO can 
do, Scripting languages, whether running 
on the server or client, suffer from the 
problems normally associated with inter¬ 
preted, weakly typed languages. Misspell 
a variable name in VBScript and you've 
created a new variable. If you're lucky, it 
will generate an error. ASP suffers partic¬ 


ularly from die difficulty of mixing script 
and HTML in ways diat retain stincture 
and maintainability. It is also obvious that 
too liberal use of ADO in script will lead 
to a very tight coupling between the ap¬ 
plication logic and the data schema, This 
is acceptable for small applications, but 
not for critical systems. For this reason, we 
have argued here in favor of using ASP 
widi ADO to manage application state and 
glue components together, in this role, they 
serve as one more layer in an architecture 
dial may include dynamic web pages on 
die front end, and mission-critical object 
interfaces on the back end. 

DDJ 

(Listings begin on page 111.) 
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PROGRAMMER'S TOOLCNEST 



The Hot Views Graphics Library 

A graphical user-interface library 
designed for scientific modeling and simulation 

David P. Neddie 


T he Hot Views (Hv) graphical user 
interface library was designed for use 
in scientific modeling and simulation 
applications. It Ls layered on top of the 
ubiquitous UNIX-based X, XL, and Motif li¬ 
braries, Hv will likely be useful to you if: 

• You are developing a model or simula¬ 
tion of real objects that you would like 
to display using a (floating-point) world- 
coordinate system, and/or you need to 
generate scientific/engineering style 
graphs of simulation results, 

• Your application requires pointer feed¬ 
back; when a user points at a repre¬ 
sentation of an object on tine screen, you 
want to provide instant visual feedback 
regarding die object’s state, 

* You want to be able to print the graph¬ 
ical representation of the simulation, or 
render it to a Postscript Me for import 
into another document 

These may sound like trivial require¬ 
ments until you consider that X, Xt t and 
Motif do not provide such services. They 
are pixel based, making representations 
of real objects tedious. Nor do they pro¬ 
vide direct information regarding which 
object contains the pointer Finally, fhey 
do not support the translation of low-level 
X-based drawing into Postscript. 

The world-coordinate systems main¬ 
tained by Hv are a more substantive fea¬ 
ture than you might first suspect. You can 
easily render floating-point coordinates 
onto a pixel-based display by simple scal¬ 
ing. However, Hv will automatically han- 


David is a physicist at the Thomas Jeffer¬ 
son National Accelerator Facility in New¬ 
port News, Virginia, He can be contacted 
at heddte@jlab.org or beddle@jdhtech.com. 


die the arbitrary linear transformations as¬ 
sociated with zooming, .scrolling, and ob¬ 
ject rotation. Furthermore, Hv does not 
indiscriminately transform every object. 
For example, suppose you placed a “print” 
button on top of a map. If you zoomed 
into tile map, Hv would transform the bor¬ 
ders accordingly but the button would 
stay the same size. Other Hv features in¬ 
clude drag-and-drop, integrated plotting, 
zooming, time-step simulations, anima¬ 
tion, drawing tools, double buffering, vir¬ 
tual desktop, online and balloon help, map 
drawing, drawing primitives, scrolling, sim¬ 
plified fonts/colors, 3D sculptured look, 
image menus, and limed redraws, 

Hv’s drawing is optimized to redraw 
the minimum amount of a window that 
needs refreshing. Hv also lets you ear¬ 
mark items for off-screen caching. For ex¬ 
ample, a war game might cache maps, 
which are expensive to draw. With Hv, as 
tokens are dragged over the map, the user 
doesn’t have to wait for the background 
maps to be recomputed and redrawn. 

Hv is comprised of about 60,000 lines 
of ANSI C code. Applications have been 
written in C/C++, it has been tested on 
virtually all flavors of UNIX, including Lin¬ 
ux. The Linux version has been tested with 
commercial Motif libraries as well as the 
recently available and cleverly named 
“lesstif freeware Motif clone.” 

Hv was developed at the Thomas Jef¬ 
ferson National Accelerator Facility, a De¬ 
partment of Energy nuclear physics re¬ 
search laboratory. It is in use at many 
government and commercial sites world¬ 
wide and available via anonymous ftp 
from ftp.cebaf.gov in /pub/heddle/Hv or 
at http://www.cebaf.gov/~heddle/Hv. The 
distribution is replete with demos and a 
comprehensive programming manual. 


The Hv Paradigm 

An Hv application is contained in a sin¬ 
gle X window, called the “main window," 
with a single menubar, not unlike a Mac¬ 
intosh desktop. Within the main window 
are additional window-like objects called 
“views." Views might contain independent 
simulation displays or may represent dif¬ 
ferent perspectives of a single model. Fig¬ 
ure I is an example of an Hv main win¬ 
dow containing multiple views. This 
structure was chosen over independent X 
windows for each view to avoid having 
to hunt around a cluttered screen for a 
lost window . 

Tile views beliave as you would expect. 
They can be moved, resized, dosed, 
scrolled, and exploded, within the con¬ 
fines of tiie main window. However, un¬ 
like most windows, the views can be ar¬ 
bitrarily zoomed by rubber-banding a 
rectangle while the middle mouse button 
is pressed. 

Contained in each Hv view are Hv 
items. These include Hv versions of com¬ 
mon control widgets such as .sliders, but¬ 
tons, rainbow scales, and wheels. There 
is also a suite of world-coordinate-based 
items, such as points, lines, rectangles, 
ovals, text, and polygons. More impor¬ 
tantly, views contain user-defined items 
that represent real-world objects or 
graphs. Typically, developers only have 
to provide the code that draws the item 
(using die Hv drawing primitives), Hv will 
take care of the item maintenance and 
Postscript rendering. 

Hv provides many items that already 
exist as Motif widgets, including buttons 
and other control widgets. This was done 
partially to provide a common interface; 
you do not use a Motif API for some ob¬ 
jects and an Hv interface for others. Tills 
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(continued from page 92) 
also supports widget sets other than Mo¬ 
tif, The widespread acceptance of Motif 
and die fact it is now bundled on all UNIX 
workstations has allayed this concern. Still, 
the minimization of the use of Motif may 
prove beneficial in porting Hv to non- 
UNIX platforms such as Windows 95. 

To summarize, an Hv application has a 
single main window that confines an ar- 
bitrary number of Hv views. Each Hv view 
is comprised of multiple I Iv items. The 
items come in two basic llavors — control 
widgets and user-defined representations 
of real objects. 

Hv Views 

An Hv view is comprised of three dis¬ 
tinct areas: 

• The HotRect is the canvas where ihe 
representation of the simulation will lie 
displayed, 

* The ConLrol area is where application 
state is maintained and control items are 
typically displayed, 

■ The Feedback region is where live up¬ 
dates asstx'tated with tracking the point¬ 
er are reported. 

Figure 2 is a typical view arrangement 
which designates the three areas. Figure 
2 also includes a familiar set of controls 
for dragging, resizing, exploding, hiding, 
and scrolling an Hv view, 

Hv Programming 

A preliminary design should lie carried out 
prior Lo the development of an Hv appli¬ 
cation. This involves deciding how many 
different types of views are required and 
then deciding (for each view type) what 
control items are needed, what HotRect 
(usually user-defined application-specific) 
items are needed, and what manner of 
pointer tracking (feedback) would be in- 
fonnative. 

The program hvmap in Figure 3 (in¬ 
cluded as a demo in die Hv distribution) 
displays political maps on either a Mer- 



Figure 1: Multiple Hv views displayed 
on the Hv main window. 


cator or a globe-like orthographic projec¬ 
tion. As part of a preliminary' design phase, 
1 decided to use just one view type, pro¬ 
viding controls so the user can toggle be¬ 
tween the two projections. Additional con¬ 
trols allow users to toggle the display of 
a latitude-longitude grid and a GPU- 
expensive rendering of elevation data. 

The alternative was to define two dis¬ 
tinct view types, one for each projection. 
I rejected this as unnecessarily complicat¬ 
ed. One view type does not mean there 
can be only one view. Any Hv view can 
be indefinitely cloned: 1 can have any 
number of hvmap views, each a carbon 
copy in regard to the controls and feed¬ 
back, but independently positioned, sized, 
and zoomed, some showing Mercator pro¬ 
jections and others orthographic. This is 
a common tradeoff in an Hv application— 
to choose between defining a new view 
type or adding controls (usually radio but¬ 
tons) on a base that selects different ren¬ 
derings. 

The main program in an Hv application 
generally consists of three lines: one to 
initialize 1 Iv, one lo call the application's 
private initialization, and a final call to en¬ 
ter an event dispatching loop, hvmap's 
main program is presented in listing One 
(listings begin on page 112). 

The first call handles the Hv and X/Mo- 
tif initialization. Hv_ValnttiaIize, like many 
I lv routines for creating views, items, and 
other objects, uses NULL-terminated, 
variable-length argument lists, the bulk 
of which are keyword-value pairs ini¬ 
tializing an attribute of the object (or, in 
this case, process). Variable-length rou¬ 
tines all begin with the HvJ/a prefix. Hv 
initialization performs (transparently) all 
the underlying X and Motif initialization, 
creates the main window, allocates fonts 
and colors, and creates the menubar. 
HvjGo implements an event loop where 
X events such as pointer motion, point¬ 
er clicks, and keystrokes are handled. 
You don’t need to knew any of the de¬ 
tails of the Hv initialization, or how Hv 
processes X events, 

Variable-Length Argument Lists 

The use of variable-length argument lists 
was indispensable in making Hv pro¬ 
gramming tractable. The alternative was 
complicated creation signatures with 
many parameters or a slew of attribute- 
setting routines called after an item or 
view was instantiated. Since this tech¬ 
nique may be of general use for any pro¬ 
gram, I will take a momentary detour to 
explain how variable-length argument 
lists were used in FIv. 

A variable-length argument list in Hv 
looks like Hv_VaX(r ) t ,.., rn ? . un , 

NULL), where rl through rn are required, 
v 1 through vn are optional, and NULL ter¬ 


minates the argument list. At least one re¬ 
quired argument must be provided; you 
cannot write an ANSI C procedure that 
has only optional arguments. (However, 
ANSI C doesn’t require NULL termination; 
you could have one of the required ar¬ 
guments describe the optional arguments 
that follow. This is how print/ and relat¬ 
ed procedures work in C) 

For example, the procedure for creat¬ 
ing Hv items has one required argument, 
die view (of type Hv_View) where the 
item will live. The prototype for the func¬ 
tion is llvjtem Hv_VaCreateltem(Hv_View 
View , .. .X The ellipsis is not representa¬ 
tive—you must actually type three peri¬ 
ods. You will probably have to include 
the stdarg.h header file (or the older, non- 
ANSI sys/varargs.h header file) to use 
variable-length argument lists. The top of 
this procedure declares vajist ap f then 
calls va_start(ap ! View) to initialize ap 
for handling the variable arguments. View 
is the last required variable here. The 
vajist and ua_siart symbols are part of 
Standard C. Once ap is initialized, you 
can use it as any other variable. In par¬ 
ticular, HvJvaCreateltem passes it to 
HvjGetAttributes, which takes the vajist 
variable and processes it, stuffing the re¬ 
sults into an array, which the remaining 
code in Hv_VaCreateltem uses to initial¬ 
ize the item as specified. 

Tile only remaining issues involve how 
Hv_GetA tributes processes the vajist. Ba¬ 
sically, ii pulls off one argument and 
checks if it is the NULL terminator. If it Is 
not, it assumes it is a keyword and pulls 
off the next argument as the correspond¬ 
ing value. It places this in a union and in¬ 
terprets Lhe value based on die data type 
associated with the keyword. Listing Two 
is the code for HvjGetAttributes. 

Designing an Hv Application 

When developing an Hv application, the 
bulk of your weak goes into die private, 
application-specific initialization routine 
Inii(). Here is where an application’s 
initial views are created along with the 
controls, other standard and user- 
defined items, and entries into the feed¬ 
back area. Any routine initialization 
(such as initializing global variables) and 
menubar additions or modifications are 
also done here. Listing Three presents 
the application-specific Init() procedure 
for hvmap. 

For example, the WindowTile and 
Add Logo procedures use Hv primitives for 
tiling tlie main window' widi a propagan¬ 
da graphic and drawing a corporate logo 
on a special welcome view present in 
many Hv applications. Ini (Controls mere¬ 
ly initializes some global variables. The 
LnitQuickZoom procedure sets up prede¬ 
fined zooms for hvmap, available via a 
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Figure J Two views from the program hvmap (one of the demos in the Hv 
distribution)> 



Figure 2: Scientific plot view, which is 
available in any Hv application. 

mo use-button-controlled pop-up menu. 
These details are best left for you to un¬ 
cover when building and examining the 
demos at length. All of these procedures 
are Optional- 

Listing Four b the MakeMenus proce¬ 
dure, which modifies the default menus 
provided to all Hv applications. This cre¬ 
ates, via variable-length aigument calls to 
Hv_Va CreateMen ultem , three menu items 
and attaches a callback (to be invoked 
when the user makes that selection) to 
each. These particular items are attached 
to predefined menus—Help and Action— 
present in all Hv applications. You can 
also create an entirely new menu and at¬ 
tach items to it. 

Listing Five shows the Finallnit func¬ 
tion, Hv_AddPath adds a search path that 
Hv will automatically scan for data files. 
Hv_InitMaps initializes the maps. (The 
map initialization is an Hv call, Hv_ 
InitMaps, because support for drawing 
maps is built into Hv r not in the hvmap 
demo,) The interesting line is the 
New View line, which creates an initial 
map view. 'Fhis procedure is too long to 
present here, but the most important part 
of it is the embedded call to HvJ/aCre - 
ateView, which creates the actual view; 
see Listing Six. 

For example, the {HvJNtTLAUZE, setup) 
pair in Listing Six tells Hv to call a private 
initialization routine after the view is in¬ 
stantiated. In that routine (ViewSetup, in 
this case), the applications will attach the 
items (as well as the feedback instructions) 
to the view. Listing Seven creates the ra¬ 
dio buttons in Figure 3 that let users tog¬ 
gle between projections. It actually cre¬ 
ates a complete set of radio buttons. If a 
new projection is supported, you only 
need to add another pair (HvJDPTlON, 
"NeivProjection ") to the argument list. 


Conclusion 

When you create views and items, you 
attach callback procedures that are in¬ 
voked in response to various actions. For 
example, you can provide a feedback 
procedure for a view that Hv will call 
when the pointer moves within the 
view’s HotRect. You have to convert the 
pointer position provided by Hv and cre¬ 
ate a meaningful string for Hv to display 
in the appropriate position in the feed¬ 
back area. 

There are two ways to create user- 
defined items. One way is to create tru¬ 
ly user-defined items that Hv knows 
nothing about. This approach requires 
you to provide a procedure for drawing 
the Item as well as support procedures 
that tell Hv how die object should re¬ 
spond to scrolling, resizing, rotation, and 
other events. Ifs usually better to pig¬ 
gyback on an existing Hv item. For ex¬ 
ample, many items can often be repre¬ 
sented by world-based polygons — those 
with vertexes stored as world, rather 
than pixel, locations. Hv has a prede¬ 
fined world-based polygon, so creating 
an item based on it is advantageous be¬ 
cause Hv already knows how to handle 
at 1 the basic drawing and item mainte¬ 
nance. You need only provide a cus¬ 
tomized drawing procedure that Hv calls 
after (or in lieu of) its basic polygon 
drawing, effectively converting the item 
from generic to specific. 


Since hvmap only displays maps, and 
maps are built into Hv, it has no need to 
create new items. A more serious Hv ap¬ 
plication is ced, which is used to display 
data in a massive (six-meter diameter) nu¬ 
clear physics detector. One component of 
the detector is a device called a “drift 
chamber.” Two drift chamber items are 
visible in Figure 1. They are in die top¬ 
most view, sitting on a multicolored back¬ 
ground and are tiled with hexagons, some 
of which are filled. The code that creates 
one of these drift chamber items is pro¬ 
vided in Listing Eight. 

This is an example of the piggyback tech¬ 
nique. The item is created not as a user 
item (witli HvJYPE of HvJLEERJTEM), but 
as a world polygon, from the predefined 
suite of Hv items. After instantiation, die 
type is changed to a user item, and the 
drawing is redirected to a user-provided 
drawing routine. 

An alternative is to use the Hv_US- 
ERDRAW attribute to specify a drawing 
routine. With this approach, Hv will first 
call its built-in world polygon drawer, then 
call the user provided routine. By over¬ 
riding the standarddraw pointer (as in 
Listing Eight), the default world polygon 
drawer is never called. 


Dp ) 

(Listings begin on page 112,) 
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Profile-Guided Optimizations 

Optimizing object code 
using run-time information 

Gary Carleton, Knud Kirkegaard, and David Sehr 


B etween 1992 and 1993, our compil¬ 
er group at Intel was involved in de¬ 
veloping optimizations for a new su¬ 
perscalar processor. We found these 
optimizationsinstruction scheduling 
and code generation to avoid processor 
stalls—effective for many applications. 
However, we encountered some apps that 
didn't achieve the expected performance 
improvements. These programs seemed 
to have two common characteristics. First, 
they were large applications consisting 
of hundreds to thousands of functions, 
resulting in very large executable pro¬ 
grams. Second T they spent little execu¬ 
tion time in loops; instead, they actively 
exercised large portions of the executable 
code. Much of the application-execution 
time was spent in branches and function 
calls and returns. 

One outgrowth of our experiences is 
our implementation of profile-guided op¬ 
timizations. Profile-guided optimizations 
work by feeding information about how 
a program executes back to the compil¬ 
er. This allows the compiler to focus its 
efforts more effectively on regions of pro¬ 
grams that matter for execution time. In 
the case of die problem applications, pro¬ 
file information also enables the compil¬ 
er to make the best use of the hardware 
resources available on the processor. First, 
it improves the performance of the in¬ 
struction cache and paging mechanisms. 
It does this by helping die compiler to en¬ 
hance program locality; that is, placing fre¬ 
quently executed code together, while 
moving infrequently executed code away. 
Second, profile information enables the 
compiler to improve die effectiveness of 
branch prediction hardware by rearrang¬ 
ing branches. 


The authors are engineers for Intel Tiny 
can he contacted at gary_carteton@ccm 
'Sc.intel.com. 


The profile-guided optimization tech¬ 
niques we develojied have been incorpo¬ 
rated in the Intel C/C++ Compiler Plug-in 
(http://developer, int el. com/ d rgf pentiumll/ 
appnotes/compiler.htm), which can be in¬ 
tegrated into Microsoft’s Developer Studio. 
Tli is makes it possible for you to use inline- 
assembly instructions that are currently not 
supported by Visual C++ 5.0. The Intel 
C/C++ Compiler Plug-in is compatible with 
Visual C++ 4.x or later compilers when it 
comes to command-line switches, inline- 
assembly format, object modules, library 
and DLL formats, debug and C++ symbol 
formats. Other optimizations provided by 
the plug-in (which are not currently avail¬ 
able wilh Visual C++) include a rounding 
control option that optimizes floating point 
to integer conversions, 

In this article, we discuss profile-guided 
optimizations—how the compiler collects 
run-time information—and present sev¬ 
eral of the optimizations performed by 
profile-guided compilers. 

Figure 1 is a program fragment that il¬ 
lustrates a number of optimizations dis¬ 
cussed here. The fragment contains a white 
loop enclosing an if statement. The shad¬ 
ed regions in the code represent basic 
blocks, the representation used by profil¬ 
ing. A basic block is a sequence of code 
in which either all or none of the code 
executes. Usually, a basic block ends with 
a branch, call, or return. Figure 2 illus¬ 
trates the control-flow graph built from 
those basic blocks. Edges in the graph 
show how the flow of control can trans¬ 
fer between basic blocks, and each edge 
is latieled with the number of times con¬ 
trol transfers through it. 

Give the Compiler More Information 

Most compilers make optimization deci¬ 
sions based on static heuristics. Notably, 
a compiler may decide whether to inline 
a function based on either its size or 


whether it is called from within a loop. 
Profile guidance enables the compiler to 
make better optimization decisions by us¬ 
ing knowledge about how the program 
actually runs. With this profile informa¬ 
tion, the compiler can do a better job of 
inlining functions, ordering basic blocks, 
allocating registers, and so on. 

Three-Step Build Process 

In contrast to a traditional compiler that 
uses a single compilation to optimize an 
application, profile-guided compilation re¬ 
quires three phases: 

• The first phase is the instrumented com¬ 
pilation. Instrumented compilation typ¬ 
ically produces an executable program 
that contains probes in each of the ba¬ 
sic blocks of the program. Each probe 
counts the number of times a basic 
block executes. If die block is a branch, 
the probe records the direction taken 
by that branch. 

• The second phase is the profiled exe¬ 
cution, When lain, the instrumented pro¬ 
gram generates a data file that contains 
die execution counts for die specific run 
of the program. 

• In the third phase, information from the 
profiled execution (or executions) of the 
program is fed back to the compiler. 
This data Ls added to the compiler’s con¬ 
trol flow graph. 

Basic Block Ordering 

Without profile feedback, the basic blocks 
of a function are usually ordered simply 
to eliminate branches to other branches. 
The blocks in an if-then-else construct, for 
example, may be ordered according to 
some heuristic to try to improve branch- 
predictor effectiveness, but, in general, 
blocks are placed according to simple syn¬ 
tactic rules. For example, Figure 3(a) 
show's how a conventional compiler might 
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Figure 1: Sample program fragment. 
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—► = Frequently Executed 
—► = Infrequently Executed 


Figure i* (a) Conventional basic 
block ordering; (b) improved basic 
block ordering , 


(continued from page 98) 
order tire basic blocks in the executable 
for the example in Figure 1. The edges 
outside the layout show control transfers 
between blocks that are not adjacent* Ba¬ 
sic block ordering rules often fail if the 
condition of an if statement is poorly pre¬ 
dicted or predominantly takes the direc¬ 
tion opposite of the heuristic* This results 
in three performance penalties that are 
addressed by profile-guided block layout. 

First, many processors employ a first¬ 
time branch-prediction heuristic (for in¬ 
stance, “forward branches not taken, back¬ 
ward taken"). If a branch in a program 
does not conform well to this heuristic, 
then it is possible that the first-time pre¬ 
diction may be wrong most of the Lime. 
In the layout given in Figure 3(a), this pre¬ 
diction will be wrong because the first it¬ 
eration of the loop exercises the code in 
block F rather than block 1). This is rela¬ 
tively unimportant when the program is 
small and the branch executes a large 
number of times. 

If the program is small enough, the pro- 
cessor can remember a prediction for each 
branch in the program, using the branch's 
history to predict its future behavior How¬ 
ever, for large applications, this may not be 
possible since there may be too many 
branches for the branch predictor to re¬ 
member. If tills happens, then static pre¬ 
dictions will again be used for the branch. 
Using profile-guided block ordering, a com¬ 
piler is able to .select branch conditions to 
make better use of static branch prediction 
heuristics. Figure 3(b) shows a layout that 
makes better use of static prediction. 

Second, compilers usually lay out blocks 
according to a static heuristic to reduce the 
number of branches along some path, say 
the then part of an if-then-else; path C-D- 
F in Figure 3(a), This choice requires an 
additional jump in the else part (block F). 
If tine else part is executed more frequent¬ 
ly, more jump instructions are executed. 
From the profiling information, the jump 
will be executed 90 times if the blocks are 
laid out as in Figure 3(a), A profile-guided 
compiler attempts to place blocks to min¬ 
imize the number of jumps along the mast 
frequently executed path. With profiling in¬ 
formation, the compiler will note that the 
path from D to F executes less frequently 
than the path from E to F, and will place 
the code as in Figure 3(b). This results in 
an unconditional jump at the end of block 
D that executes only ten times. 

Third, some blocks of a function—often 
error handling and recovery code—are 
seldom (or never) executed. Compilers of¬ 
ten place such seldom-executed bbcks in 
the middle of a function, filling the in¬ 
struction cache and virtual memory paging 
system with code that is never executed. 
Tills is probably not significant for small 


programs, but for large applications, it’s bet¬ 
ter to move infrequently executed code out 
of line to reduce the amount of code that 
is paged in or fetched into instruction cache. 
Profile-guided compilers can determine that 
such code is infrequently executed and 
move blocks out of the main body of the 
function, typically to the end of the func¬ 
tion. This results in less instruction cache 
pollution and, to a lesser extent, fewer pages 
brought in during a run. 

Profile-guided compilers may also be ca¬ 
pable of moving infrequently executed 
blocks out of the code containing the func¬ 
tion, and placing them at the end of the en¬ 
tire application. This function splitting can 
result in significant improvements in pag¬ 
ing behavior, while retaining the function¬ 
ality contained in the infrequently execut¬ 
ed code. This optimization has a number 
of interesting implications in compiler de¬ 
sign, such as how to compute address sizes. 

Register Allocation 

One of the most powerful optimizations 
performed by a compiler is register allo¬ 
cation, A register allocator attempts to as¬ 
sign registers to variables in a way that 
minimizes the number of times variables 
are loaded from memory. Because it is 
common to have more variables than 
available processor registers, a register al¬ 
locator must decide which variables to 
keep in registers, in which portions of a 
function, and which variables to reference 
from memory. 

Typically, the register allocator uses a 
static heuristic, preferring to keep vari¬ 
ables used in loops in registers, for ex¬ 
ample. This results in register allocation 
that is locally optimized within the loop, 
but it may not produce the best total per¬ 
formance. 

For example, in Figure 1, if block D con¬ 
tains a computation that uses a large num¬ 
ber of registers, then the compiler may not 
keep x in a register. If there were variables 
that were live across the loop, the com¬ 
piler mighL move register stores to mem¬ 
ory (spills) and reloads outside the loop. 
If tiie loop has lew iterations, this decision 
may be worse than placing die spills and 
reloads inside the loop. Furthermore, in 
die example, the loads and stores required 
to perform the increment in block E may 
be expensive. Hence, keeping x in a reg¬ 
ister, spilling it on entry to block D, and 
restoring on exit from that block results in 
much better code and ten bads and stores 
of x rather than 90, Profile-guided com¬ 
pilers are able to use execution frequen¬ 
cy to select points for spills and reloads 
that are infrequently executed. This allows 
the register allocator to use registers for 
variables in regions that are heavily exe¬ 
cuted, and avoid spilling variables around 
infrequently executed regions. 
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void A (...) C 

InlinedProc (...Is 

}*** 

void InlinedProc (...) [ 
if (condition] 

// Code that is frequently executed when called by A 

else 

H Code frequently executed when called by others 


Example I: A case where (filming before profiling is advantageous. 


Function Inlining 

Function inlining is another optimization 
that benefits from profiling. Function in¬ 
lining substitutes Lhe code associated with 
a procedure call (call site) with the actu¬ 
al code for the function allied {callee). In¬ 
lining removes the call overhead (setting 
up parameters, and so on) and makes it 
possible to optimize die callee in the con¬ 
text in which it is called. 

Profile information provides execution 
counts of the call sites, thus enabling the 
function in liner to focus on die important 
call sites within the program. Function in- 
lining without profile information may in¬ 
line functions at call sites that are not im¬ 
portant for performance, thereby increasing 
code size unnecessarily. It is important to 
control this code-size growth when inlin¬ 
ing because it may affect instruction cache 
misses. 

It is passible to profile before in lining, 
and vice versa. Each approach has its ad¬ 
vantages, For example, die program in Ex¬ 
ample 1 illustrates a case where inlining 
before profiling is advantageous. In Ex¬ 
ample 1, if profiling occurs before in lin¬ 
ing, then the profiling information for In¬ 
linedProc will reflect its behavior over all 
sites where ii is called. Suppose a condi¬ 
tion usually evaluates to False, Profile- 
guided optimization would arrange the 
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branch so dial the False block comes be¬ 
fore die True block. However, if a condi¬ 
tion is usually True when InlinedProc is 
called from A, then this results in exactly 
the wrong optimization for a copy of In¬ 
linedProc inlined into A. 

On the other hand, Example 2 illustrates 
that it is difficult to do inlining before pro¬ 
filing, because recursive functions cannot 
be completely inlined. Furthermore, inlin¬ 
ing all function calls, even in nonrecursive 
functions, may result in an executable pro¬ 
gram so large that it is not practical to run. 

In our compiler implementation, we 
profile before making any inline decisions. 
To date, we have not observed cases 
where the behavior of a function differs 
dramatically based on its caller, which 
seems to support this choice. 


Function Layout 

Function layout is a form of code placement 
similar to basic block reordering, Function 
layout is not intended to reduce inaccurate 
branch predictions, but to improve instruc¬ 
tion paging and instruction cache misses. 
Thus is achieved by placing functions with 
frequent calls between them close to each 
other in the executable code, making it 
more likely that the functions will be on the 
same page or on pages that can be kept in 
main memory simultaneously. For small 
functions, this also helps keep the functions 
in tine instruction cache simultaneously. 

A call graph shows die calling relation¬ 
ships between functions. There is one 
node for each function, and a line be¬ 
tween a function A and a function B if A 
calls B. The edges have a weight that is 
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the number of times A calls B in the in¬ 
strumented run. Figure 4 shows a call 
graph that has the edges annotated with 
the call weights. 

Function layout is a complicated prob¬ 
lem and any reasonably fast solution to the 
problem is based on a heuristic, in our 
compiler, we have implemented three dif¬ 
ferent algorithms for function layout. Two 
of them are based on a depth-first walk of 
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Figure 4: Call graph. 



Figure 5; Source order ; 


the call graph, and the other is based on 
Pettis and Hansen’s graph-collapse method 
(see ‘‘Profile Guided Code Positioning, 11 by 
Karl Pettis and Robert C, Hansen, Pro¬ 
ceedings of the ACM SIGPLAN '90 ). 

Our descriptions of the three function 
layout algorithms use die call graph in Fig¬ 
ure 4. Normally, the compiler places func¬ 
tions in the object file in lexical order, that 
is, the order in w hich they appear in the 
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Figure 6: DFO order 
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Figure 7: HCC order 


source file. Figure 5 shows the source or¬ 
der for our example. 

The simplest function layout method is 
called depth-first ordering (DFO). Starting 
at the top of the call graph, the functions 
are ordered by doing a depth-first w r alk of 
the call graph. If profiling information is 
available, the outgoing edges from a func¬ 
tion are walked from most to least frequent. 
If no profiling information is available, the 
outgoing edges are walked in random or¬ 
der. Even without profile information, this 
does a better job in most cases than just 
ordering die functions in source order. 

Figure 6 is the DFO layout of our cal) 
graph in Figure 4. Starting at the root, func¬ 
tion A is first placed. Following the high¬ 
est call weight, function C is placed next. 
Continuing down the call graph, E and 
then D are placed. The DFO walk Lhen 
returns to A, then places function B. The 
layout is now complete since function D 
has already been placed. 

The second method is highest-count 
caller (HCC), This method is also based 


void B (,.,■) [ 

B(. ■■} 
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Example 2; ft is difficult to do 
inlining before profiling. 
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on a depth- first walk of the dynamic call 
graph 3 starting at the top. However, be¬ 
fore the walk, only the incoming call edge 
with the highest execution count is kept 
for each function. This way, a function 
will be placed closest to the function that 
calls it most frequently. Figure 7 shows 
the resulting function layout. 

The last method is closest is best 
(CIB). The idea is to always pick the call 
edge in the dynamic call graph with the 
highest execution count remaining in 
the call graph. If there are two edges 
with the same weight, one is chosen at 
random. For each edge selected, the 
caller and callee nodes are merged into 
one node, and the call edges are up¬ 
dated accordingly. Two merged func¬ 
tion nodes in the call graph indicate that 
the functions are placed next to each 
other. This process continues until the 
call graph is reduced to one node. When 
a node is merged into an already merged 
node, the function is placed closest to 
the function with the most immediate 
call relation. Figure 8 shows the result 
of CIB on our call graph. 

Using function layout, we have mea¬ 
sured up to 80 percent improvement in In¬ 
struction Translation Lookaside Buffer miss¬ 
es (ITLB is an on-chip cache that saves 
paging information to increase paging per¬ 
formance). On small programs, however, 
this may not mean a large performance 
improvement, since small applications nor¬ 
mally are not limited by paging and in¬ 
struction cache performance. On very kuge 
applications however, these effects may 
result in a measurable performance im¬ 
pact, On some larger applications, we have 
seen Lw r o to four percent improvements in 
instruction cache misses. 

Conclusion 

Profile-guided optimizations provide nu¬ 
merous opportunities to improve perfor¬ 
mance, only some of which have been dis¬ 
cussed here. For instance, to improve the 
performance of C++ programs, profile- 
guided optimizations must consider indi¬ 
rect function calls. C++ virtual function in¬ 
vocation can be optimized significantly by 
profiling not just the frequency of calls but 
also their targets, This enables the simple 
optimization of replacing the indirect [unc¬ 
tion invocation by an if test for the pre¬ 


dominant type, using the indirect call only 
when that test fails. Currently, our compil¬ 
er does not profile the targets of indirect 
function calls, but we consider this the most 
significant extension yet to be added. In 
addition to reducing the number of indi¬ 
rect calls, this optimization also enables 
better function placement, better function 
inlining, and better compiler analysis. 
One of the challenges for profile- 
guided optimizations is to achieve wide 
acceptance and usage by software ven¬ 
dors. The more-complicated build pro¬ 
cess required by application developers 
is often an impediment to wide accep¬ 
tance. Furthermore, the profiled run of 
die program typically takes significantly 
longer than an optimized run, and the 


application developer may not know 
what constitutes a representative data set. 
The benefits of the additional overhead 
in first doing an instrumented compila¬ 
tion, then running the application with 
representative data input sets, and final¬ 
ly compiling with feedback of the pro¬ 
file information, must be understood and 
appreciated before profile-guided opti¬ 
mizations will become a part of the nor¬ 
mal application build process, However, 
for large applications that are executed 
frequently, profile-guided optimizations 
may prove to be a significant source of 
performance. 
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int &var_eelector(int n) 
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int var_l = 
int var_2 *= 


1? 

2; 


iff n 1 ) return var 
else return var_2; 


int main() 
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int &x = var_selector(1); 
cout « "value = "; 
cout « x j 
return 1; 
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Customizing 

DDX/DDV 


Use this trick with finite 
sets of scripting elements 

Jean-Denis Bertron 

■ recently had to create a system that 
would let users put scripts together by 
picking programmatic elements and 
placing them in a list. I decided to pro¬ 
vide users with an interface similar to that 
of installation systems, as a front end to 
a CGM generator library, (CGM, short for 
“Computer Graphics Metafile," is an ISO 
standard for 2D computer graphics stor¬ 
age and exchange of images. If you are 
familiar with the TIFF, WMF, or VRML for¬ 
mats, you know what I T m talking about.) 
The technique I developed uses custom 
data exchange routines to add macro pro¬ 
cessing to Windows dialog boxes. You can 
use this interface with installation systems, 
serial-communication packages, or just 
about any environment where the set of 
scripting elements is finite. 

Since all scripting elements require pa¬ 
rameters, I decided to create a C++ class 
derived from CDialog for each. By asso¬ 
ciating a dialog resource to each class, this 
design eliminates the need to store pa¬ 
rameter data elsewhere. Thus, die class 
for a typical dement defining a circle is 
derived from CMFElem, a generic script¬ 
ing item class, which is itself derived from 
CDialog. CiassWizard does not, however, 
support custom classes when creating a 
class for a new dialog resource. Never¬ 
theless, this was a minor inconvenience, 


Jean-Denis is a software engineer for Hen¬ 
derson Software. Fie can be reached via 
e-mail atjdBcgm.com. 


since I derived dialog classes from CDia¬ 
log and merely substituted CDialog in the 
code with the name of my base class, CM- 
FElem. Figure 1 illustrates the class hier¬ 
archy and main attributes and services for 
the circle element. The generic CMFElem 
class factors in some of the scripting item’s 
common attributes, such as the com¬ 
mented and indented states. By default, 
CiassWizard does not create serializable 
classes. 


The circle element has center and ra¬ 
dius attributes that are associated with 
three edit controls on the dialog resource; 



see Figure 2. CiassWizard generated List¬ 
ing One (listings begin on page 137) for 
the parameters in this class after I used it 
to create data members for the controls. 

In CGM, parameters such as coordinates 
can be integers or reals. This is why 1 
chose CString for the parameter types. 
However, storing only constant data in the 
elements was too restrictive. Users would 
ask to change the value of parameters at 
run time to generate, for example, con¬ 
centric circles. As it turns out, one solu¬ 
tion to tills problem is to create custom 


Dialog Data Exchange and Validation 
(DDX and DDV, respectively) functions, 
DDX exchanges data between dialog con¬ 
trols and their corresponding member vari¬ 
ables in a dialog class; DDV validates this 
data. 

Creating Custom DDX/DDV Functions 

Microsoft technical note TN026 describes 
how to create custom DDX/DDV func¬ 
tions and add GassWizard support for data 
types using die routines. The MFC sam¬ 
ples include a program called “CHK- 
BOOK” to illustrate die technique. Using 
this method, you can associate a custom 
type to any control on a dialog resource. 
You must first create a new data type. 
Next, you must create an exchange func¬ 
tion to access the control’s attribute and 
till die data member. When data is going 
the other way, the function will update 
tlie control’s state using die data member. 
Last, you must add a few lines to Class- 
Wizard’s configuration file in your project 
directory or in Visual C++’s directory. You 
can also implement a DDV routine for data 
validation, bur this is often inadequate be¬ 
cause the function must take exactly one 
additional parameter, 

I chase to implement a class to store an 
expression as a string, validate, and re¬ 
solve it when necessary. The classes 
(Macro and CParser (available electron¬ 
ically; see "Resource Center,” page 3) do 
this, CParser implements a top-down pars¬ 
er I prototyped using Leopurd (see "Build¬ 
ing Parsers with Leopurd," by Thor Mir- 
chandani, DDJ f March 1996). CM aero, a 
storage class similar to CString, offers two 
main services: CMacro can parse or re¬ 
solve the expression it stores, The class 
uses a CParser object to perform these 
operations. I chose to separate the ex¬ 
pression storage from die parser for con¬ 
venience because the parser’s grammar 
would evolve. You could combine both 
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classes into one, but consider the impact 
on serialization: Will a small change, such 
as adding a flag, to the parser code affect 
the macro object's serialization? 

Copying die expression from the macro 
to the parser's internal buffer causes some 
overhead and extra memory allocation. 
The grammar I used is simple, providing 
basic mathematical operators; see Exam¬ 
ple 1. It is fragmented because Leopurd 
cannot process left-recursive productions. 
Variables are prefixed with an ampersand 
when referenced. For instance, the pars¬ 
er will validate the expression lg(8tbits) 
+&sign , where Sahits and &sign are refer¬ 
ences to global expressions. I will discuss 
these later. CParser shows die implemen¬ 
tations of the DDX routine and the seri¬ 
alization and assignment operators asso¬ 
ciated with the CMacro type* 

By adding the two lines in Example 2 
in my project's CLW file under the [Gen- 


macro : expr EOI; 

expr : a trot Zexpri 

Zexpr i EMPTY ! 

' + 1 expr | 
expr ! 

OR expr j 
XOR expr : 

stmt : term Zstrat; 

Zb trot j EMPTY | 

term | 
tern | 

*%' term 1 

AND term ; 

term i number ' 

expr ')' i 

LOG '{' expr ')' 1 

LN 'V expr ’)< j 
LG expr T ) f 1 

EXP 1 (' expr »)' | 

ABS '(■ expr ■)■ [ 

SIM 1 ( 1 expr | 

COS 1 C 1 expr ')' \ 

TAN '( 1 expr ') 1 ! 

ASIN t {‘ expr »)' ! 

ACOS *(' expr ')' ! 

ATAN 1 ( 1 expr ')' | 

NOT »(■ expr O’ j 
DIV 1 C 1 expr * r T expr 1 )' 

CNF ' (’ expr V. 1 expr 

i 

FQW *(* expr ',* expr *}* 

AMP * £ * expr ', r expr r )* 

number : KEXVAL ! 

afloat ! 
i +' afloat S 
1 - 1 afloat 1 
NAME ; 

afloat : INTEGER decimal exponent 

decimal : Wm j V INTEGER ; 

exponent : EMPTY 3 

■E' Eexp: 

Eexp : INTEGER \ 

INTEGER 3 
1 + 1 INTEGER : 


Example 1: Grammar used to 
provide basic mathematical operators * 


eral Info] Section, 1 enabled ClassWizaxd's 
support for a CMacro type. 1 could now 
use a CMacro object to access an edit con¬ 
trol, listbox, or combo box. For all exist¬ 
ing classes with data members linked to 
controls in a resource, 1 had to remove 
the data member and add it back as a 
CMacro. For class ECCirde , the new data 
members coded by ClassWizard look like 
Listing Two. Because all my dialog- 
derived classes are serializable, 1 had al¬ 
ready written code for ECCirde to store 
or load the data members as in Listing 
Three. 

I needed to write code to serialize a 
CMacro object. To avoid changing this 
code, 1 decided to create the overloaded 
« and » operators for the type. Both 
operators merely call the CMacro Serial¬ 
ize member function, which does all the 
work* These functions are located at the 
bottom of the listing for CParser (avail¬ 
able electronical ly). 

Global Variables 

To give expressions access to global vari¬ 
ables, I had to add an extra parameter to 
die two resolution functions in CMacro. It 
is a pointer to die document, because in 
my application, the document Ls the place 
where elements are resolved. This pointer 
to the parent document is stored in the 
CParser class during die resolution. Clear¬ 
ly, this is preferable to passing the pointer 
to all functions involved in parsing, which 


would waste stack space, a precious re¬ 
source in top-down parsers. It is safe to 
save a pointer to die parent document since 
it is a C++ object, not a window. 

The validation routine does not need 
such a pointer, unless you want to vali¬ 
date the existence of a variable without 
resolving its value, I added die extra pa¬ 
rameter to it, although 1 do not use the 
feature. The function CParser:. number^ 
actioni) uses the pointer to call the doc¬ 
ument's FindByName member to locale 
and evaluate a global variable. 

1 used a collection of CMacro types to 
store global variables at the document 
level. This has the advantage of allow¬ 
ing code reuse. Also, users only have to 
learn one macro grammar. In addition, 
CMacro is a great candidate for being 
stored in a CObArray collection, since 
its parent class is CObject Finally, the 
resolution of global variables can stay 
consistent with the rest of die hierarchy* 
For example, a scripting element's ex¬ 
pression can reference a global macro, 
which can reference any other global 
macros. This was an easy design choice, 
although other alternatives might have 
heen equally interesting. For efficiency, 
you can consider storing global variables 
in a collection class using the C++ Stan¬ 
dard Template Library, 

To prevent reentrance of the code, as 
mighi happen with circular references be¬ 
tween macros, l set an internal Hag inside 
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Figure 1; Class hierarchy and main attributes and services for the circle 
element , 


Ext r aDDXC omit-1 

ExtraDDXl=ELIMNn:;Value;CMacro; ;Macro;Maero Expression 
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Example 2; Adding these lines enables ClassWizard*s support for a CMacro type. 
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the CM aero object before it is resolved 
and the parser checks before attempting 
to resolve the macro, 

CMacro lias an assignment operator. 
Good C++ class design usually mandates 
that you define a constructor, destructor, 
copy constructor, and assignment oper¬ 
ator. 1 wrote CMacro'$ assignment oper¬ 
ator to simplify the code in scripting el¬ 
ements' assignment operators, where 
CMacro types are being copied; see List¬ 
ing Four, Note that the parameter to the 
operator is a reference to an object of the 
base class. Assignment operators are not 
inherited. The default memberwise as¬ 
signment operator supplied by the com¬ 
piler takes an ECCirde reference as a pa¬ 
rameter. Why, then, is this virtual operator 
used when invoked via pointers to EC - 
Circle elements? Because a standard con¬ 
version is available between an ECCirde 
object and an object of the base class, 
CMFElem, The operator is used as die 
default assignment by the compiler. 

Why did 1 need virtual assignment op¬ 
erators for my scripting elements? The ap¬ 
plication uses the elegant llndo/Redo de¬ 
sign pattern described by Jim Beveridge in 
'implementing Multilevel Undo/Redo" 
t DDJ t February 1996). The Undo/Redo 
framework keeps track of user actions by 
saving copies of modified objects. The 


Undo/Redo internals are better isolated 
from die elements implementation by mak¬ 
ing copies of scripting elements using an 
assignment operator on element pointers. 
Tills explains why the operator takes a base 
class reference as its parameter. 

Alternatives 

I did not seriously consider alternatives 
to this method for implementing the 
macro functionality. You can certainly 
design a similar system around a few- 
high level classes, such as a “macro- 
expression manager" to process expres¬ 
sions that are submitted for evaluation. 
This architecture separates the expres¬ 
sion parsing from dialog classes, elimi¬ 
nates the need for a macro type and pre¬ 
serves data members as CString 
(significant benefits when dialogs are cre¬ 
ated at run time or when portability is 
important). The Extra DDX feature is plat¬ 
form dependent and tool specific, How¬ 
ever, because my application features 
over 200 dialog classes, relying on Class- 
Wizard’s support for the CMacro data 
type and the transparency of serialization 
and assignment operators was a time-sav¬ 
ing strategy in its development. 

I did not illustrate how to use a custom 
DDV function, primarily because the ex¬ 
pression’s validation occurs in the DDX 



Figure 2: The circle element has 
center and radius attributes which are 
associated with three edit controls on 
the dialog resource, 

routine. This shows that DDV routines are 
optional, although a DDX routine is nec¬ 
essary if you wish to use a DDV function. 

Conclusion 

A iite” version of my application is avail¬ 
able electronically. It features only a few 
scripting elements. The grammar for die 
parser is included, and the Undo/Redo 
framework is functional. Credit goes to 
Thor Mirchandani and Jim Beveridge. 1 
would appreciate hearing alx>ut die ways 
you have benefited from this article. Any¬ 
one have a self-translating CEdit field? 

DDJ 
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Y2K COMPRESSION 


TO WS-CENTURY 


Listing One 

/* routine: sliding.window 

* Author: Robert Moore 

* Takes a reference four digit year {year), a window width above the 

* reference year tupperwinwith)„ and a one or two-digit year being queried 

* (qyMr). arid returns a four-digit representation {ryear) of the query 

4 year based on the window and the reference year if the query year ie one 

* or two digits, Qu&ry years greater than two digits are just returned. 
Setting reference yeer to a conetant would make this a fised window- 

int eliding window(int year, int upperuinvidth, int qyear) 

E 

int yeardate, centurydate, ryear, temp; 
ryear = -1: /• statue that will b* returned on arret */ 

if {qyear < 100) /* the query year is a two digit year — 

convert to four */ 

C /a bounds check */ 

if {(year 5= 0) if. {0 < upperwinwidth) 6S (a 4= qyear}) 
f /* everything a proper bound */ 
yeardate = year % 100: convert year to a year */ 

ceUtUrydate = year - yeardate; /* and cautury +/ 
temp = yeardate * upperwinwidth; /* calculate upper window limit */ 
if (temp < 1M) 

[ 

if (teinp < a qyear } 

ryaar = centurydate - 100 +■ qyear; /* year 'wraps to previquo */ 
else /* century: case lb */ 

ryear = centurydate + qyear: /“ same century: case la nr lc+/ 

) 

else 

t 

if (qyaar < {temp - 100)} 

ryaar * centurydate' + S00 + i^yeat: /* year in next century: caae 2c */ 
else 

ryear = centurydate + qyear; /* same cautury: case 2a or 2b */ 

3 

} 

} 

else 

ryear * qyear; /* three or more digit year^-don"t convert */ 
return ryear: 


Listing Two 

/* Authors; B, Greg Foley, Rnbert L. Moore 

/+ Description: Thia function determinee if a four digit year is a leap 
/* year, A year ie * leap ycat if it is divisible by 4 but not by 1W, 
/* except years that are divisible by 400. 

/* Parameter Description: 

/* int Year - a four-digit year 
/* returns 1 if the year io a Leap Year 
/* 0 if the yaar ie NOT s Leap Year 

/* Notes: For clarity of the algorithm no arret checking is included. 

int LEAFYEAR(int Year) 

( 

if ((Year U = Hi Year * 100 1= 0) \\ Year % 400 ™ 0) 
return 1; /* This is a LEAP year *■/ 

else 

return 0; /* This ie NOT a LEAP year */ 


Y2K SOLUTIONS 


listing One 

01 BILLCOM-RECORD. 


05 

B-EPF-DATE, 

10 B-EFF-YY 

PIC 

9(02) 

VALUE 0, 

05 

B-COWV-DATE. 

10 B-COHV-TY 

PIC 

9(02) 

VALUE 0. 

05 

FILLER 

PIC 

9(96). 


listing Two 

01 BILLDQff-KEGDED. 

05 S-EFF-DATE, 

10 B-EET-YY 

PIC 

9(04) 

VALUE 0. 

05 

B-COWV-OATE, 

10 B-DONV-YY 

PIC 

9(04) 

VALUE 0. 

05 

FEIGNER 

PIC 

9(96}, 



W3-PTVOT-YY 
MOVE ‘19’ 

ELSE 

HOVE r 20' TO WS-CENTURY 

END-IP 

END-EVALUATE 

END-PERFORM, 


Listing Four 

VALID ATE ■-DATE-ROUTINE. (Performed by the Windowing routine) 
EVALUATE TRUE 

WHEN WET-VALI DATE - DATE EQUAL ZEROS-YYMMDD 

MOVE ZEROS-CC TO VS-VALIDATE-DATE-CENTURY 
SET WINDOW-INDICATOR TO FALSE 
WHEN WE-VALIDATE-DATE EQUAL NIMES-YYMMDD 

MOVE MINES-CC TO VS-VAL1 DATE-DATE- GENTURY 
SET WINDOW-INDICATOR TO FALSE 
END-EVALUATE. 


Listing Five 

01 WS-DATE, 

05 WS-YY PIC XX, 

05 WS-MH PIC XX. 

05 US HDD PIC XX. 

Bi uq.TrtTT 

W^YY-Yi* pic M). 

IF WS-YY f 75 

PERFOEUt 1000-PRT-RTN, 


Listing Six 

WS-YY < 75 
If WSYY-Y2K < 1975 
PERFORM 1000-PRT-ETN. 


Listing Seven 

4 K*+****i ■ tie •+# * a** ■ a** *+■*■« e ■ *+* • *** *** • 4=*=t * * 

* ORIGINAL LINE OF CODE: 

* IF B-CLH-5TU5-CD = ■Q' AND CLM-CLG-REOP-DT-ND (STATS? 

* < TEN-LS&-DT- DftCO NEXT SENTENCE ELSE 

*** Y2K START 44* 

IF B-CUJ-SIUS-GD - '0 H 

MOVE CLM-CLG -REQP-DT-HD (STATS) TO WE - JULIAN-WORE-DT-Y 2 It 
PERFORM JULIAN'DATE-RTN-Y2K THRU JULIAN-DATa-RTN-Y2X*EXlT 
MOVE WS - JULIAN-WGRK-DT-Y2K TO CLH-CLG-EE0P-DT-HD-Y2K 
MOVE TRN-ISS-DT-DRUO TO WS-JULIAN-W0RK-DT-Y2K 
PERFORM JULIAN-DATE-RTN-Y2K THRU JULIAN-DATE-RTN-Y2R-EXIT 
MOVE WE-JULIAN-WORK-DT-Y2K TO TON-ISS-DT-DRCQ"T2K- 
♦ note the period 

IF B-OM-STUS-CD « 'O' AND CU<-OG-RJEOF-BTHD-Y2K 
< TRN-ISS-DT-URDO-Y2X NEXT SENTENCE ELSE 
«* Y2K END it* 


HDF 


Listing One 

f include "hdirh" 

I define WIDTH 5 

fdefine HEIGHT 6 

main{int atgc, chat »argvLJ) 

( 

uinta palatta_dau[7bS]: 
into i; 

/* Initialise the Image array *f 
atatic uintS raater.data[HEIGHT)[WIDTH] = 

E 1, 2. 3, 4, 5. 

6. 7. H. 9, 10. 

11. 12. 13. 14. 15. 

16. 17. 10. 19, 20, 

21. 22. 23, 2k. 25. 

26. 27. 20. 29. 30 ): 

/* Initialize the palette to Standard linear grayscale */ 

for (1=0: i<256; i+4) t 

palette^dataB*3l - i; 

palotto-datal i*3n] = i: 

palette.data{i+3 + 2] =» i; 

} 

/* Aaaociate the palette with the image *f 
EPRSaetpalette(palette.data): 

/* Write the E-bit raster image to the file */ 
DFE&addimage("eiemple.hdf™, raater.data, WIDTH, HEIGHT, 0); 


Listing Three 

WORUING STORAGE SECTION. 


01 POLICY-STAKT-DATE. 

05 POLICY-START-DATE-YY 
05 POLICY-START- DATE -MM 
05 TOLICY-SrAFT-DATE-HD 


PIC 9(02) VALUE EEROS, 
PIC 9(02) VALUE ZEROS. 
PIC 9(02) VALUE ZEROS, 


Listing Iwo 

0include "bdf.h 1 ' 

0include "Rfhdf.h" 
0define LENGTH 3 
0dsfine HEIGHT 2 
Vdefine WIDTH 5 


01 WS-CONVERT-DATE. 
05 WS-CENTURY 
05 W5-D0NVEET-YY 
05 W5-QQWVERT-W1 
05 V5-OONVERT-DD 


PIC 9(02) VALUE ZEROS. 

PIC 9(02) VALUE ZEROS 
PEC 9(02) VALUE ZEROS 
PIC 9(02) VALUE ZEROS. 


01 WS -FI YQT-DAXE, 

05 VS-PIVOT-YY PIC 9(02) VALUE 30. 


PROCEDURE DIVISION. 


MOVE FOLICY-START-DATE-YY TD VS-CONVERT"DATE-YY 
MOVE POLICY-START-OATE-MM TO VS-CONVERT-DATE-MM 
MOVE POL ICY-START-DATE-DD TO WS-CONVERT-DATE-DD 
PERFORM VALIMTE-DATE-ROUTINE 
EVALUATE TRUE 

WHEN WINDOW-YES 

IF WS-CDWVERT-YY IS GREATER THAN DR EQUAL TQ 


main(let arge, char *argv[l) 

( 

/• Initialize the image array */ 

Static flcatCA acien.data(LENGTH] fHEIGHTJlWTDTH] = 

E i.. 2.. 3., 4,. 5.. 

6-. 7,, a.. 9,. 10.. 

11.. 12,, 13,, 14,. 15,, 

16.. 17., IB., 19,. 20.. 

11.. 22,. 23., 24,. 25,. 

26.. 27., 20., 29,. 30. ); 

int32 dims[3] = (LENGTH. HEIGHT, WnmJl: 

int16 acaleD[LENGTH] = [2, 4, 6); 

int32 acalel [HEIGHT] ^ (1234567. 2345676): 

floatSi acale2[WIDTH] = (2.2, 4,4, 6.6, 0.0, 11.0); 

float64 avg = 15,fl: 

int32 start[3] = (0, 0, 03; 

int32 fid. adid, dimid0. dimidl. dimid2: 
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(continued from page 109) 

/* Open file M<f initialise SD interface *f 
fid = SDstart["EiflmpLe.hrff h . DPACCCRSATBI; 

/* Create named data set */ 

sdid = 5Dcreate(fid, Sample data Set 1 *. DFBT.FLOATfiA, 3, dims): 
f* Set up ditsenfliDai iero p/ 
dimidB - SDgetdimid(sdid, *); 

EDeetdimname (dimidfi. "Dimension fl 11 }: 

SDcetdimEi re (diraidtf. "The soroth. ii iaenaion*. W, "2d"): 
SDsetdimsc ale (diaid# r LENGTH. DFNT IWT16. {VOIDP } seal*&h 
/* Set up dimension one */ 
dimid 1 = SDgetdimidfedid. 1); 

SHaatdtmnaiimidtfflidi. "Pisans ion l"h 

SDsetdimstra(dimid1, ’'The first dimension", "cm", "Bd"); 

SPsetdimscale( dimid 1. height. nFjrr_jwrw. (vorOPJaemleL}; 

/* Set up dimension two */ 
dimit!2 = SDgatd{jiid(sdid. 2)j 
SDsetdimname(dimid 2 , "Bimerteioii 2“>; 

SDaetdimetra{dirndl, "The second dimension" r "m", ■ H A. 1 f; 

SDset dimEcale(dimid2, WIDTH, DFNTJLOAT32. (V0IDHse*le2); 

/• Write the date array to the data eet ■/ 

SDwritcd&tatsdid. atarr. NOLL. dims, (char *)seJtfi_daia): 

/* Add one Local attribute */ 

SDsetittrCwIld, "Average"., DEOT.PLOATCA, I T (dmr *)&A¥g); 

/* Add one global attribute */ 

SDMtBttrtfld, ’‘Dst* 41 . DFOT-CHARfi, 9. *< (0/29/53"); 

(* Close the date eet, file, and interface ■*/ 
SDandaecossfadid): 

EDendffid); 


CC^bBOueryProp * sortField(); 
CCeDBQueryProp * operator LI(int fieldJte); 

private z 

bool wtebField(CCeDBfrcp*. int): 

GCeDM&eryProp ** .fields: 

GCeDBQueryFrojj * .sortField; 

int .numFielda: 
int .moxFields; 

WORD .index; 
bool .rang#: 


Listing Two 

elasa CCeDBQueryFrop ; public CCeDBFrop 
[ 

public; 

chum FioldOperation ( 

Equal, LE, GE 

void setOperationfFieldOperation); 
FieldOperation getOperation£): 
private: 

FieldOperation .operation: 


Listing Three 

fintlude Catting.h> 
line lode "hdf^h 11 
I include "tnfhdf.h" 

Kdafina KAXRANK 1 

Idefine LENGTH 3 

IIdefine HEIGHT 2 

Hdefine WIDTH 5 

Ndefine DATES1ZE 9 

meinHint eege, cher *argvIU 

( 

floitM acien.dat*[LENGTH! [HETG3TT1 [WIDTH]: 

intll dimelMAXEANKl; 

int 16 scaled[LENGTH] : 

intl2 scale L[BEIGHT]; 

float! 2 aeale2 tirrom]: 

floatbA svg; 

int 3 2 start[MAKRAHKj = (0. 0, 0): 
int32 fid, edid. dimid: 

int32 i. index, rant, naltra, ndataeets, nglobals; 
int32 nt. count, status; 
into sica; 

cher name [Btil. date [80j: 

/* Open file end initialise SD interfeeo */ 
fid = SDatart (" example, bdf”, DFADC .RDONLY); 
statue *■ SPfileinfotfid, Andatasets, AnglobaJa): 

/• Heed global attribute 
if (nglobale «- 1) [ 

etfttue = SDmrinfhtfid, 8, name. Ant. Anise): 
if {iattetrp{name, "Dmt*")) At (nt == DFffT.CHARB) AA 
fait* = DATESIZZn 
SOreedsttrHid, 0. date): 

] 

/* Open firet data Bet */ 

index = SDnsmetoindex[fid, "Sample Date Set"); 
edid > SDselectCfid. index): 

SDgetinfo(sdid, name, Srsnk. dfmfl. Ant, JAnattrfl); 

/* Head in data if ouerythiug looha okay */ 

if ((rank = HASHME] hh [dixifl[0j “ LENGTH) hi. {dima[I] = HEIGHT) 
6* (dtma [2] = WTDTH) it. [nt = DFWT.FLOATM)) 

SDteeddsta(edid. start, HULL, dime, dcieit.data); 

/" Read Local attribute */ 

etattia ■ SDettrinfo (edid, '0. nemr. Am , Aeite); 
if {(atremptname, "Average M )) AA (nt = DFNT.FUJAT64) A& 

(size - l)> 

EDteadattr{edid. 0, Aavg): 

/* Road dlnaneiona */ 
dimid = EDgetdimid{edid, 0): 

SDdi»info{di*id. name, (rcount, Ant. Anattre): 
if ((nt DFWT.LHTi'fe) hh {count == LENGTH)) 

SDgetdimaexlo{dimid. ecaleO): 
dimid * EDgetdinid(edid, 1); 

SDdiminfo{dim id, name. Acount, Ant. Ahettrs): 
if ((nt = DENT LNT3Z) hi (count HEIGHT)) 

SDgetdimacale(dimid, ecalel )e 
dimid ■ SDgetdimid(edid, 2); 

SDdiminf a (dimid, name. Acaunt. Ant, Ansttre) ; 
if t(m - PFNT.FLXIAT32) «A (count — WIDTH)) 

SDgetdimscala(dimid, ecalel); 

/* Cloae the data ml, fila. and interface */ 

SDendacceas(adid )e 
SD ondffid): 


WINDOWS CE 


Listing One 

class CEQoery 

I 

friend class CCeFindHBDetabeselteTater; 

public: 

CEQue ryO; 
virtual “CEtjueryf); 

void addField(CCflD&DueryFrop *): 
void JetSortField{CCeDB<iucryFrop *): 
int numFielda(void): 

CCeDBQueryFrop * teatDnfl/DHD. WDEIl): 
bool natch(CCaDBSccord*. bool A. int A): 


Listing Three 

clnsa CCaFindPRDatabaae t public CCADADatabeae 
{ 

public: 

CCeFxndDBDatabaeelterater * Find (GEQusry A )E 
bool TejtFind {GBiQuary A ); 
private: 

CCaDRQumryFrop * FindXcylCBQtierv A): 
bool LeihdeXed[CEFRUP LH, WORD *)! 


Listing Four 

class CCefiudDHDatabaSeItSrater 

t 

public: 

OCeFindnflDarabflaeltfl rater (CEDuery *, CCbHBOuaryFr op *. CC-eDBDatabage *)t 
_ vCcPindDhDat abanclto ra tat(): 

CCeDBKecerd 1 current(); 

GCoPbRscord * first 0: // rotors asatch 

OCeDBRecord ■ oext ()e 
void Htatsdnt A. Int A. bool &)j 
private r 

bool -Jtart: 
bool _.eof: 
kit .hits: 
int .teste e 

CCeDBQueryFrOp * .primatyQliery; 

CCeCRDatabaea » .myDbE 
CEDuery * .myGuery j 
WORD .IteyOrder: 


Listing Five 

void CExupl eDl g: rOnlnit () 

C 

CCeDEDatsbase neufIB; 

CCoDSJProp sort a [2] i 

aorta[0]-5*tTypo(eCsDBProp::Typo_Strkif): 
sorts {02 .SetldentU); 

aorta [0] .SetSoirTFlagj(OIJeDB,Prop e eBo rt_Ascending ) e 
sorts(ij.SetType(CGsDBFrup::Type.Long): 
sorts[1].SetIdent(2): 

aorta 111.SatSortFlsga(OOaDbFrop:;£ort.D^acanding >; 

if (cwwDR.Croat*(TEXT(' , JCSExttfflpU' , ) f 123, 2. sorts) ™ NULL) 

t 

rwvDB. Op«n<TOT( "JCSExampl*")): 
oevDB.Delete{); 

eiowDB,C roat*(TE(T{"JCSE**fflple"'), 123, 2, sorts); 

1 

nawDR-Open <TEXT [*JCEExamp1*")); 

CCeDBRacerd r; 
r.AddFrup(Aaortfl[0)); 
r,AddFrop{Aporta[l]): 

r.AddFtop(new CCeDBFro^(GCeDBFrOp:tTjpe.Filetime, 3))J 
far (ini i = 0: i < 1000; i«) 

{ 

TCHAR atring.field[256|s 
FILETIHE filctiae.field: 

SrSTEHTIHE syatemTimeE 
int J; 

// generate three random values for fields 
CCoCBProp* p ■ r,GetPropFtonrd*nt{l); 
string,_field [0] = (TCHAF.) {' A 1 +Rand l» ()V26); 
for (J*l; j < Unt)(5 + (fLamiw»a*10)); j«) 
string.field[j] = (TCHAR)( 1 s 1 +Rendora() %26); 
string_fi*ld[j] * 0; 

p->SotString(string.field]: 
p = r.GetFrupFrD«ldent(2); 
p->S*tLong( Random£)): 
p = t.GetFropFrD*IdEot(3)J 

m*bs*i (AaystemTiime. 0. s iceo f (syat neTiae)); 
sysremTime.vfear = (unsigned short) (1500 ♦ Random01200); 
syatamTime.vMohth - (imsigned short) (1 + fandom()i 12): 
aysteaTiM.vDay = (unsigned abort) (1 + Random0^28); 

SystewTimeToEilcTims(AsyatemTime, Lfilstime.field); 
p->5etJiletif( filetime, field); 
uowDBrAddRscordf&r); 

) 

3 
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Listing Six 

CC«DflftueryFroj?;:PisldOperntIon .opfltJ - f 

OCeDBQueryP rop :l LE, CCeDBQue ryProp::GE. OCeDBGueryPr&p;;Equs1J: 
GCflDBQufrryPt'Oip * q.p; 

CEQue ry q • 

if {n_fllvil J* -i ii Ia_af ll.IfiBaptyO) 

t 

q.p * fsc* COBBQuciryPropt,); 
qp->SetTypa(GCeDB,Pr£jp ■:Type_£tring); 
qp->SetEdenc(l)[ 

! 1 P “ > Set St ting ((TCHAR •) (LecrSTR)m„sfllj t 
q p-> setOperation(_opsJlLSllval]}; 
q.addfield(qj?); 

3 

// NOTE; other query properties omitted t bee listing 

CCeFindDBDaiabaee xheDB: 

theD3. Open {TEXT (" J U5Exati$> le '"J ); 

CCeFindDBDatflbaEeltetetar ■results = theOE.Find(q): 

TRACES("\nREanlts:Vn”)i 

for (CCsBBRecord *r * (results-> first (); r; c ■ results 
E 

CCelWrap* atrfJeld * r-X^tFropFtomldofitd); 

CCeDBFrop* intfield = r->GetPropFroinIdent (.2); 

CCeEBPrnp* dstefield ■ r->G«tPropFrc!«Ident(3) ; 

FlLETIHE £ * {datefield-XJetFilet ine O 31 
SYSTEffTIKE %: 

FilETimeToSyeteliTime £frf „ Get}; 

TCBM timeStr [2^6] : 

GetlateFo rmat (LOCALE. S¥STEH_DEFA(JLT, DATE, LCWGDATE,it.B r tine St c,25fih 
TRACE3 {"\T3a\t%d\tSa\n" r strfield->Get3trijigO * 
int f del d ->G«Leng (). tim*£trJ; 
delete r; 

1 

int hits, teats; 
bool range: 

re suite-> state (hits, tests, range}; 

TRACERf"State\n\matches; iSdVn'Stmdssee: M\n\tTottl: fcd\n". 

hits* tEsts, kits+teatsJ; 

TRACEK "Range search *a usedW". tenge T TEXT("was") : TEXT("was not")): 


ACTIVE DATA OBJECTS 


Listing One 

<i— UEEKLQGLN.HTK “> 

<HTKL> 

CHMO 

<TrrLE>Applicatinn Lngin</TITLE> 

</KEAB> 

«EDDY BGCOLORS «FFFFFF"> 

Blesse login with your user name and psasMord, 

iFDEM MAHE= , 'LoginFom" METHOD "FQST" AGTIOffT=" login . asp" > 


<DJFDT HAHE="UaerName ,h TTFE="TEXT" S3ZE=25* 

£INPUT WAlffl= "liter Pud N T¥fE“"TEKT h ' SKE-2S? 

{INPUT T¥FE="RESET" VALUE="Clear ™ > 

< INPUT TTPB"“SEJBMIT" VALUE*"Log ;n"> 

</fdrm> 

t/FtTML> 

<t— Loom, ASP — > 

LANCDAGE= ,t VSBCRIDT 1 "* > 

i% 

Response. Buffer * TRUE 

I 44**4**44**44**444*4***4***4***4*****4*44*4**4******4*4*4************ 

1 This script handles the user login 

r 4 4 * *4*444 **4 * 4**4 * *4 * 4*4 * 4*4 * *4 4 4 *44 * **44•4**4 **44**44 *************** 

UserNaue = Hequeat.ForEE 1 'DserKame"} 

UserPvd * R*quost,FonB( lf Ufl«rPwd") 
if UserName = "" or UserFvd t* '" TT then 
Reeponea r Red i reel ( 11 userlcgi n, Jvrs" 1 
end if 

cADOCormec tSt r ing = 11 d sn=Eaample _dh r Uid=ua e r r jH.d=uee rpwd u 
set Conn = Server .CresteOb]>ct ("ABODE,Connect ion lh ) 

Cotm. Open f eAMConneC t St ring) 

Query = "SELECT UserlD from tbl„UBer vhere (rUBerMBme = # UserHsme 
E ") and (DserPud = " & UeerPwd 4 ■))* 

Sat RS - Conn.Eaetute(Query) 
if not RS.EOF then 
Userid » .^("UserlB 1 *) 
else 

Response, Rfldirect f*user login, huB' 1 ) 
end if 

REdirectURL - "welcome. aep?ae£aitey=’ , 

{Juery = "SELECT SceeioctlB from thl_ Sosa ion where User ID ■ " _ 

& CStr (tfserld) 

If nut RS.EfJF then 

RedirEctdRL - RedirectURL 4 RS("SesaionlD") 
alas 

DateTime > Date (1 " " 6 Time 

Qnery = "INSERT tbl Session valnesi" 4 CStr{Userid} A ", Mr _ 

A DattTina £ SELECT «#IDHJTTT¥" 

Conn,Execute(Query) 
set RE 5 RS,NoxtRecordSei 
RedirectURL = RedirectURL 4 RE{0} 
end if 

Response,Redirect{RedirectURL) 

*> 


Listing Two 

<|— WELCOME. ASP —> 


(continued on page 112) 


Every direction carries a solution 


SAME COMPILERS FOR UNIX 

WINDOWS NT 

* Secure path for future on Intel 

* I A-32 versions today 

* IA-64 at first system ship 

FORTRAN 90 

* F90 Standard Conformance 

* HP, DEC, Cray & Microsoft Extensions 

* Super-scalar & Parallel Optimizer 

* Excellent run-time diagnostics 

C++ 

* ANSI Draft Standard 

* C++ targeted optimizations 

* Standard + Rogue Wave Libraries 

* Early dialects + ANSI C 

UNIX_ 

* Field proven for Intel, PowerPC, 

Sparc, Mips 

Third party marks and brands are the property of their respective owners 

CONTACT EPC FOR FURTHER INFORMATION: 



WINDOWS NT 


WiMDOWS 

NT 



USA 

International 


tel: (217) 398-5787 
tel: +44 (0) 131 225 6262 


e-mail info@epcxom 
e-mail info@epc,co,uk 


http://www.epc, com 
http://www.epc.co.iik. 























Putting You In Control... 

SftBox/DLL 

Combo box with 
multiple columns, 
headers, column 
drag & drop, virtual 
data, and much 
more.,. 


JE* Ken* DoetiToeh 136. ! 1 Jt Fknn Flrts flam 

'B'wm usTd dc ' 


| THeECTLIiy 
I TREEX HC 


TrdHira. Flam 223 

ivflsstv 

5««4e Ptoaian, 

|@ PmtPQ 
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SftTree/DLL 

SftTree/OCX 

Single and multiple 
selection tree 
control, multiple 
columns, cell editing, 
headers and more 
grid-like features. 



SftTabs/DLL 

Add tabbed windows and 
tabbed dialogs to all your 
applications. Many tab styles 
and advanced features. 

These royalty-free controls 
support Windows 3J, NT end 95 
In 16 and 32-blt applications. 


MoifW] (941) 505-8600 

Not The Industry Standard FAX (941 ) SOS-8555 


316 Tamiami Trail, Suite 14 - Punta Gorda, FL 33950 


ca" 


www.softelvdm.com 


PCYACC Version 7.5 


PROFESSIONAL LANGUAGE DEVELOPMENT TOOLKIT 

Includes 'Drop In" Language Engines for SQL, dBASE,, POSTSCRIPT 
HYPERTALK, SMALLTALK-BO, C++. C r PASCAL r PROLOG , FORTRAN , 
COBOL BASIC. SGML ASN. RPG , REXX r PL f, SNA. RTF ■ VISUAL 
BASIC , SQL2 r DB2 , VHDL , HTML, VMRL , JAVA, ODMGODUOOL, SQL3 r 
MODULA-3 , DELPHI, V8S . and ADA. 

PCYACC Version 7.5 is a complete Language Development Environmeni that 
generates C. C++. Java, Delphi, Visual Basic, and VB$ source code from input 
Language Description Grammars for building Assemblers, Comptiera, Interpreters, 
Browsers, Page Description Languages, Language Translators, Syntax 
Directed Editors, Language Validators, Natural Language Processors, Expert 
System Shells, and Query languages 

■ Portable Object Oriented Classes for C++ and JAVA - Error, Symbol 
Table, Syntax Tree, Yacc, and Lex. 

■ Debugging tools include runtime Visual Parser Debugging, Abstract 
Syntax Tree generation, and Cross Referencing. 


CodeCheck Version 7.5 


SOURCE CODE ANALYST 

Includes "Drop-in " Rules for Compliance analysis. Adherence to 
specifications. Measures of complexity. Silent error detection , Code 
maintainability, and Portability 

CodeChecN Version 7,5 is a programmable tool for managing ali C and C++ 
source code on a file or project basis CodeCheck is input compatible with ail 
variants ol K&R, ANSI C and C++ CodeCheck is designed to sotve all of your 
Portability. Maintainability, Complexity, Reusability, Quality Assurance, Style 
Analysis. Li brary/Class Management, Code Review. Software Metric, Standards 
Adherence, and Corporate Compliance Problems. 

■ Compliance - CodeCheck allows your corporate coding and project 
specification standards to be completely automated for compliance 
validation. 


CodeCheck includes pre-written expert system Rule Files that can be applied 
to any C or C++ project. 

30 day Money back guarantee! Free 
AIR Shipping anywhere in the woridl 


CodeCheck 

PCYACC 

DOS/WIN 16 

$495 

$495 

MAC 

$495 

$495 

OS/2 

$995 

$995 

WIN32 95/NT 

$995 

$995 

UNIX/VMS 

$1995 

$1995 


tel 


www.abxsoft.com 

ABRAXAS' 

Software,Inc. 


5530 SW Kelly fen.. Portland, OR 57201 USA 
TEL {503) 244-5253 - FAX (533) 244-0375 
E Mail sataseabxsan.com 

To Order CaJI 1 "800-347-521 4 


(continued from page 111) 

<W LjUfCUACE="VBSCRIFr , *> 

(\ 

Response.Suffer = TRUE 

■ ** * ♦************************ * «** **4* ******************** t »«**** M■M * 

' This page welcomes tie authenticated user 


if Request.QueryStringt"seeskey") = theft 
Respond e. Redirect ( n userlngia. ht» p ]' 
end if 

cADOCormec tString ■ " d fln»«tsjnple_dh ; ujd=ua a r: pvd=use rp«d" 
set Conn = Server,CreateObject("ADODfi.Connection^) 

Cor.ti. Open {cAJXfContifrt eSi ring) 

{Juery ■ I *SEL1CT JJterlD from tbl-Session where SesalonlD = Hi _ 

S Request.QuerySt ring( n seeskey”) 
eat RS ■ Conn, Execute{Query) 
if net RS.EOF then 
Userid * »S("UserUH 

Query = "SELECT UeerFullName ft« tbl_U*er where Us«rD ■ " _ 
k CStr(Userid) 

eet RS => Conn. Execute (Query) 
if JES.EGR then 

Response. Write {"Bed user IQ in W*1 COM. Up* 1 ) 

Response-End 
else 

GMrFullN*« = RS ("TJaerFullName"] 

CateTime = Date h + " k Ti*e 

Query “ "UPDATE tbl_5eaaion set LastAccesaTiae = 11 S DateTime _ 
4 " where SedeionID • '* 4 Request .(hse^ySt^inaf''sftBflk«y ,l ) 

Conn.Execute(Query) 
end if 
eiee 

Retponae,Redirect["userLog)n,htm") 
end if 
%> 

CHTXL> 

<HKAD> 

tTITLE>Welcome Page (/TITLE) 
c/HEAQ> 

(EDDY BGCOLQR="limrPF"> 

Welcome <^=naerFullHaM*>. 

(/BQinf> 

c/trmL> 


HOTVIEWS 


Listing One 


Kv.Valnitielixiifargt, argv. 

flv.HORAINBOV, True, 

tfvJJSMLGGNEVXIlI, True. 

Hv .VERSIONNHHEER, 101, 


Hv. I1SER5CTENDEEQQL0RS, True. 
HULL); 

InitO; 

Bv.QqOi 


listing Two 

void Hu.GetAitr dbut aa(va_list up, 

HvJlttributeArray attributed, 
cher **optliit. 

int ’tags. 

int *fonta, 
int +coli>rx, 

■hert •numnpt) 

l 

im argflng * -u 

int prirvurgf lag: 

/■|We defaults far all attributes */ 

Hv.SetDfrfeultAttrihutee(stt ributes); 

/■ now get the attributes */ 
while Cergflng 1- 9) [ 
prevargflag = argfiag; 

/* the argfUg is the keyword. Udefined as a net of sequential inta auch ee 
1define Hv HACHGROtIHB 21. etc. the NULL teminatct will produce s iero 
HTfifUg */ 

aegf lag = va arg (ap, int); /* ahu-uid he >■ teto k i BCfflfl MX */ 

if ({argOeg t 0) )\ fsrgfUg >= Hv.NUHATTRIBtFFES.)) t 
i* uninter ating error handling aaitted 
) 

else if {srgflsg > 8) C 

/+ group according to type, place in the attribute array which ia sn array of 
unions. We? need insider information about what ryp* is assocUted with each 
attribute- */ 

if ((argflag — Hv_DHAWCONTRCU 
U (HgfUg = Hv.DSERHRAVCCMTRQL) 

/•or Buriy integer attribute! oaitted •/ 

| J [argflag = Hv.RELATIVEPLACEMEHTj) 

it tributes fargf lag] -1 ■ VA-itgUp. int): 
else if ((argflag = Hy.STATE) 

I! Fargflag — Kv-USfiWBLCOHEVIOT) 

/• nr many ahnrt attributea omitted •/ 

IS {arg(lag — Hv_PLACKHENTCAP)) 

attriiutaafargflag).a 3 (short)va„arg(ap, int]3 
/• oalrring coda for other types; chare, atringa, floats, etc,. ■/ 
alae /• only thing left ata the pointers */ 
attribuieeiergfleg],v 3 va_arg(ap. void +): 

J /• end argflag > 0 •/ 

) 

va _end(a p); /* terminate the prDCe&aiftg */ 


Listing Three 

void IhitO t 

Hv_canvaaCoJor = Ev.LigktSeaCreeo: /+Chsnge BQ color •/ 
WindowTiieO; /• tile the window with propaganda »/ 

HakeMenusD : /• modify the main menu */ 

InitControle ()t f* Initialii* controlI */ 

AddLogof): f* add a logo */ 

lnitQuickZoom{); /• initialita quick zooms •/ 
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FImI In.it 0 : /• final initialization •/ 


Listing Four 

void MakaMenus() ( 

Kv-Widgat Item; 
char “tart; 

/• ctaaEa the (fichu ittfflP fur the Help menu *i 

text = {char » , )Hy_Mb11dc( atrlen(Hv_pro£raTriMame> + 20): 
strcpy{tt}(r 1 "About "i: 
atrcatftaxt, Hv .pra-ErsuiNsrae): 
street{test, 

item = Flv VaCr8BteKaEiu3tEn]{Hv l .liBlpHEnti. (tv LABEL, teit, 

Kv_CALLMCK. DoAhoutDLo S . HULL) 

HV,.Free{taxt )3 

/* add e "new view" itea to the action Menu */ 
item ■ tfv VaUreBteKEnLiltEni(Hv_actionHemj, 

Hv_LABEL. "New View", 

Hv_CALLBACK, GetNevViev* 

MLU.] 3 

/* add a plot canvas optltm 10 th« action ranu +/ 

item = Hy VaCreBTeHBnuItemfHv.BctiaiiMenu, Hv LABEL, "New Plot CanvaB 1 ', 
Hv-CALLBACK. NevPLotVtev. HULL}: 

) 


Listing Five 

static void FinallnitO E 
Hv.View view: 

UvlAddPath{"-/ifiep!wlter"}: /* add to search path */ 

Hv_„ Initttapd ("hv.naps" ); /* initialize nape */ 

view * NiwVifivtHAFVTHW); } /* create one initial view*/ 


Listing Six 

Bv.VftCreateView{AView, 

Hv.TAG, tag. 

Hv_ DRAWCQWTROL, 

Hv-STAHUARflDKAWtXlHTROL+Hv-SAVEBACKGKOUM, 
Hv PO-ELfPCONTROL, papupcuntiul, 

Hv .TITLE, tdlt, 

Hv INITIALIZE, YievSetup, 

Hv-G&SItiKIZE. VUvCtutoai». 

Hv USERDRAW, ViEwDraw. 
Hv.OmCTEBWUSBRBAAW, Of f Sc resnV i evD raw, 
Hv FEEDBACK, ViEwFEedback, 

Hv.LEFT. left. 

Hv TCP, top, 

Hv.XHIM, smin, 

Hv.XMAl, max, 

Hy_YMIN, ymin, 

Hv.THAK, jmiHK, 

Hv-HOTRECTHIDTH, width, 

I!vJJDTRJJCTHE1GHT, height, 

Hy MINZOQMlf IDTH r uinw. 


Hv.HAXZOOMtfIDTH. maxw. 

Hv-KIHZOONHHIGHT. »irvh. 

Hv HAXZOOKHEIUHT, uaxh, 

Hv_SUffROC. (Hv-PunctionPtr)Si*iln tionCB. 

Hv 5ItfINTERVAL, 2000, 

Hv-HOTBJSCTVMARGIN. 7* 

Hv HDTRECTCOLQR, Hv blue-2, 
Hv.HOTIOHLESSFB. True. 

Hv MErTEOHLESHFB INTERVAL, 503, 

NULL); 


Listing Seven 

Choice ■ ttv VaCreateltem{View. 

Hv.TYPE, Hv CHOICEEETITEH. 

liv. .RELATIVJiPLAC£K£!HT, Hv. POSITI0NBEL0V, 

Hv PLACEHEHTITEH, Boe2. 

Hv PLAOHKKMTGAP, l, 

Hv_NOMGN, mdata-> p roje.ct.ion, 
Hv.FIUCOLOft, Hv_ pewd 0 rBl ue, 

Hv.AHHCDLGR, Hv. navySlue, 

civ-COLOR. Hv_*rayl0. 

By FONT, Hv fisedZ, 

Hv-QPTIQM, "Mercator". 

Eiv OPTION, "Orthographic", 

Kv_S3NGLECIT CJL. P roj ectionChoiceCB, 

Hv TITLE. "Hap Projection", 

MULL): 


Listing Eight 

Item * Hv,.VaCteate3 tem {View. 

Hv_TYPE. Hv-WORLDFOLYGCNTTEX, 

Hv-TAG, DCITEM, 

Hv_HUMPOIWTS, up, 

Hv.DATA, fpt£. 
flv.NUHROWS, 1, 

Hv.HUKCOLUHNS, l. 

Hv UOKAIH, Hv IMSIDEHOTRECT, 

HvJJOtULECUCK, BdirDClttK, 

Hv DRAHCONTHOL, Hv.ZOOHABLE + Hv_INBACKGROUND, 
Hv_AFTEKCiFF5ET. DCAfterOffset, 

Hv.OSERl, (int)superlayer, 

Hv_EJSER2, {lot) a act, 
ttv BALLOON, {void ♦JtaEt. 

Kv_F3XSLGI0N. FiiHCSuperLayarRegion, 

MULL): 

Item-7 type = Hv_UEERITEK: /+ redefine as uaer itsa */ 

1 tem"->standarddraw * DrawUCItem: /* redirect Std. dfawlhg */ 


DDJ 
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Make the Clear Choice 

Hassle-free Web-based training. 


dwitalthink.com 


WWW. 


■ Self-paced 
• Tutor Support 
• Up-to-date Content 
No Installation Required 


"This method of delivering training is a great 
way of getting OUT employees hands-on 
education in new technologies. Implementation 
was quick: our workers could sign up on 
Monday and be going through the coursework 
on Tuesday” 

- Brad Clayton 

Training Manager, Ctrenl Semiconductor 

With Digital Think training, you don't suffer 
travel or meeting room costs. You never get 
outdated training. And you don't struggle with 
support issues - unlike CD-ROMs* there are 
no special software installations or hardware 
requirements. All a student needs is a Web 
browser and an Internet or Intranet connection. 

Far a course evaluation and more information on our 
corporate programs, contact am Sales Department 
at sales@digltalthink.com or call 415.437.2SOO. 


DigitalThink 


LEARN WHAT TOil WANT 


www.digitalthink.com 


©1990 DigitalThink, Alt rights reserved. 

















'Windows/CVI 


Virtual Instrumentation Tools tor C/C++ 


E#e Operator Loun ^onfiautaficn insuunjerts flepofl. Log 


Timebase 


Running. 


Call today for your 
FREE LabWlndows/CVI 
Evaluation Version 


LobWcfidam/CVI Tetf Execution System 


Build on a 


Rock Solid 
Test 


i, in 


I iiture 


When you need to build a BIG test system. 
When you can’t afford to get locked in... 
When it has to be fast... 

When you need a solution that works... 
When your production line depends on it... 
When your job is on the line... 


Why look anywhere else? 

✓ Standard ANSI C Programming 

✓ Lightning-fast code generation 

✓ Open development environment 

✓ VX\p!ug&play driver standard 

For data acquisition, GPIB, and VXI 
instrument control, analysis, and user 
interface tools, LabWindows/CVI has 
everything you need under one roof. 


Put LabWindows/CVI to Your Test! 
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Execution Cortfoi j Debu^i j Logging j Setup 
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Loop,bp T j 


Break 
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Powei Cn 
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Instiumeni Check 
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Linwrily Te-sts 

PASS 


G«n Contrd Tests 

PASS 


Frequency Response 

PASS 


Ana^zer Sweep 

PASS 


AmpStude Lnearity 

PASS 


Power Down 

PASS 


Ftesdts Log 
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WT NATIONAL 
I* INSTRUMENTS 

r The Software is the Instrument ,h 

U.S. Corporate Headquarters 

Tel: (512) 794-0100 • Fax: (512) 794^411 
info@natinstconi • www.natinst.com 

Worldwide network of direct offices and distributors. 


K Copyright 1997 Instruments Corporation. AH rlgMs nS raved, 

and company names listed ara tradsmarlKs or trade rtamBE of Ihalr respective 
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PROGRAMMING PARADIGMS 

A Chat with Bob Berner 


Michael Swaine 



A s the millennium looms, a comput¬ 
er pioneer has come out of retire¬ 
ment to create a product he claims 
will solve the Year 2000 problem. 
And he claims it will do so an order of 
magnitude faster than other approaches. 

This veteran programmer knows all 
about the Year 2000 problem, because he 
helped create the Gobd language in which 
much of the problem code was written, 
he created many of the standards for data 
encoding that have existed since the ear¬ 
ly days of computing, and he wrote the 
Datamation article back in the 1970s that 
alerted many to the two-digit date prob¬ 
lem that others are just waking up to now. 

Bob Berner is a programmer from the 
generation of legends: He worked with 
Fortran creator John Backus and COBOL 
creator Grace Murray Hopper and IBM 360 
designer Fred Brooks. He was enjoying a 
well-eamed retirement when lie hit upon 
his idea for solving the Year 2000 prob¬ 
lem. While another senior citizen might 
have been satisfied to pass the idea on to 
a younger programmer, Bob moved to 
Texas, set up a company, developed a soft¬ 
ware product, and started talking to the 
press. 1 recently talked with Bob about his 
past contributions to computer program¬ 
ming and lus proposed solution to the Y2K 
problem. Here’s what he had to say. 

Don't ASCII, Don't Telly 

DDJ: When I think about the standards you 
are responsible for and the projects you 
have worked on in your career, it seems 
that the name “Bob Berner" ought to be a 
household word among programmers. But 
[ suspect that’s not the case. Maybe after 
tliis interview, that will change a little. 

You’re known, among those who have 
heard of you, as the inventor of ASCII, the 
inventor of the escape sequence, the guy 
who named Cobd, a pioneer in word pro¬ 
cessing and time sharing and internation¬ 
al standards for data processing, I want to 
hear about your Year 2000 solution, but 
Fd also like to know' how you came to in¬ 
vent ASCII. 


Michael is DDJ s editor-ah large and can 
he contacted at fmwaine@sivaine.com. 
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BB: I made a survey in I960 [while work¬ 
ing for IBM] and found out there were 
over 60 different ways the alphabet was 
coded for various computers. So I started 
to pick out the problem of interchanging 
files, and I started making proposals for 
a single code. Before that I did the char¬ 
acter set for the Stretch machine at Los 
Alamos. That was the first eight-bit-byte 
computer that I know of. But I made a 
mistake. I pul the alphabet in as capital 
A, lowercase a, capital B, lowercase b. And 
that was stupid. It was then dial I wound 
up with the escape sequence idea that 1 
published sometime in i960. 

DDJ: But ASCII became more than an IBM 
standard. How did the internationalization 
of ASCII come about? 

BB: I was invited to talk to the British Stan¬ 
dards Institution and I got to go to the 
electronic industry association. And final¬ 
ly I was called by two IBM vice presidents. 
They said they would like to revitalize 
what w as then called the OEMI— the Of¬ 
fice Equipment Manufacturer’s Institute— 
and they wanted proposals from me for 
what should be done in the w r ay of com¬ 
puter standards. We had a big meeting, 
the first meeting of X3 under ANSI aus¬ 
pices. And that later became an ISO com¬ 
mittee when we went international with 
the standards for computers. 

Then came the fateful day. We had 
ASCII going. We w p ere about to sign off 
and the 360 w^as about to get out. And 
Freddie Brooks tells me that the printers 
and punches are not ready in ASCII, 

DDJ: They were still designed Ibr EBCDIC? 
BB; Right, And one manager, now r dead 
(but I w^on’t say “God rest his soul”) de¬ 
cided they were going to do both. They 
would pul in p bit, and if the p bit was 
0, the machine would run in EBCDIC, If 
the p bit was 1, it would run in ASCII. 
He thought that was a reasonable w^ay 
to solve die problem, because he had to 
announce the 360 in a hurry. So they 
did. Unfortunately nobody told the pro¬ 
grammers, and they did all their systems 
programming in EBCDIC, As a result, 
they couldn’t make the thing run in 


ASCII. So ASCII originated at IBM, but 
they didn’t follow through with it. Isn’t 
that a crazy story? 

Grace Under Pressure 

DDJ : Okay, I also have to ask, whai was 
your involvement with Cobd? 

BB: I came to IBM in late ’55 to [write] a 
system that allowed you to use the 705 
commercial computers for scientific work. 
11 was a big success, but when I did that, 
I was in the same room as John Backus, 
And I was watching the Fortran work, 
and saying, hell, tills has got to happen. 

So I got hold of [AJ.] Perils and lie al¬ 
lowed us to use his compiler from 
Carnegie Tech. And we chucked that in 
under a thing we called “Foitransity which 
is the first time we had a programming 
language that worked on different com¬ 
puters. 

And in January of 57,1 got to meet Dr. 
Grace Hopper of the Franklin Institute in 
Philadelphia. And I started working on 
commercial translators. And we sort of 
blended all those tilings together and came 
up with CoboL 

Probably it was a good thing in one 
w r ay, but it was a mistake in other ways 
because nol everybody who got in the act 
of programming computers was careful 
to annotate it. People made mistakes. Like 
in computing the year. 

DDJ: What was your main contribution 
to Cobd? 

BB: I guess the major tiling I did for Cobd 
was the picture clause. 

DDJ: Which is? 

BB: The picture clause? That’s w here you 
say, 1 have a piece of data, this is its do¬ 
main, and these are its characteristics. If 
you said 599, that was going to be a two- 
digit number, signed. And we used vari¬ 
ous other symbols to say this Is all alpha¬ 
betic, punctuation, and the like. This is 
the first data typing. If you look at some 
of the later languages like ADA, PIT, ev¬ 
erybody brags about heavy typing, you 
know, very strict. Control programmers 
against their ow r n mistakes. Well, that was 
the first data typing. 
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Visual Parse-H- is the only tool that brings 
cutting edge parsing technology to all 
developer's. You can design any parser 
without ever leaving our sophisticated 
visual development environment. 
Whether youTc new to parsing 
technology, or an experienced veteran. 
Visual Parso++ is the right tool for you. 

Visual Parse-H- provides native pro¬ 
gramming support for C, C++, Java, 
Visual Basic, and Delphi, Our C++ and 
Java classes are fully supported on all 
platforms. Visual Parse++ also comes 
with drop-in parsers for HTML, SQL, 
RTF, Java* C t C++, and more. 

New to version 3,0 is natural language 
parsing, Visual Parse++ is easily the 
most powerful parsing engine in the 
world with generalized parsing 
capability. Visual Parse++ now can 
handle any grammar, including 
ambiguous grammars. This opens up the 
area of natural language parsing, 
among other applications. Also included 
is a bi and new user interface, with many 
new features and improvements, along 
with 3D parse tree and machine views. 
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Call 800-988-9023 
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Phone: ( 
Fax: (6i 



Boh Berner 


DDJ: So you invented data typing? 

BB: I never thought of that. J didn't [actu¬ 
ally] create die picture clause, I [really] cre¬ 
ated data typing. 

Don't Blame the Programmers 

DDJ: All right, let's talk about the Year 
2000 problem. I’ve heard all sorts of esti¬ 
mates for the magnitude of the problem. 
What's yours? 

BB: 1 talked to a Gartner Group guy last 
November, and asked, how's your $600 
billion dollar estimate holding up? And he 
said, real fine. When people start talking 
about it getting into tile trillions, I won t lx 1 
surprised, except Fm going to try to bring 
it down, If the US. got our act together 100 
percent, we'd still be in terrible shape if 
the Asians didn't and the Europeans didn't. 
And line Europeans are trying to do mon¬ 
etary conversion to the Euro and the Year 
2000 |solution] at the same time. 

1 heard from a guy at Micrasoft that they 
moved the dock ahead on the equipment 
up at the Hanna ford atomic energy place 
and they blew all the air conditioning, 
they've gotta get new air conditioning. 

DDJ: That’s scary. 

BB: We know there’s a lot of nuclear stuff 
that’s gone haywire. A couple of months 
back, i talked to the gal who's running 
the Year 2000 program for the State of 
Texas, and they have a nuclear reactor, 
but she admitted she doesn't know what 
they're doing. 

DDJ : The naive question I sometimes 
hear from people is, how hard can it be 
to find references to dates in source code 
and change them? I guess one of the sev¬ 
eral ways that question is naive is that iL 
assumes the source program is even 
available. 

BB: Source code is not around in about 
30 percent of cases, 

DDJ: And in some cases people may have 
the source but it's not the latest version. 
Or maybe die code hasn't been recom¬ 
piled since the last time the compiler was 
revved and might break just as a result of 
recompiling, 

BB: Yup, There are a lot of mismatches 
out there. 


DDJ: And, of course, it’s both the pro¬ 
grams and the data. 

BB: And when you try to stretch it out 
with expansion, you know, put in an ex¬ 
tra 19 and an extra 20, everything gets 
shoved over. And those poor compiled 
programs, particularly the object programs, 
don't know where anydiing is anymore. 

DDJ: So we are in diis mess all because 
a bunch of Cobol programmers wanted 
to save a few bytes of memory? 

BB: No, Everybody blames the program¬ 
mers, but it wasn't their fault. It was peo¬ 
ple [like managers, customers]. They were 
happy they didn't have to use 19s. There 
were international standards, [but people 
wanted to use] month-day-year instead of 
year-month-day, like computers had to 
have it. 

This complicated the problem tremen¬ 
dously. If you write die year first, you don't 
have to go through a lot of fancy compu¬ 
tations. People say that they didn't have 
enough memory, Lhai it was expensive. 
Well, if they’d used the proper order they 
would have had plenty of memory because 
they could have saved all the routines they 
had to have to jockey ii around for com¬ 
pares and to write it on this or that report, 
1 get pretty vehement alx>ut that. As a pro¬ 
grammer, I don’t think we re at fault at all 

As in Bigit Baudot? 

DDJ: You have a solution to the Year 2000 
problem, but your solution is different 
from anything else out there. Apparently 
you don’t think much of approaches like 
date-field expansion and windowing, 

BB: Date-field expansion will never make 
it. They thought so a year and a half ago, 
but now they don't have time. 

It means you actually rewrite die Cobol 
program and say pic999 instead of pic99 
;md if you're lucky you'll find all the tilings 
in the program that are dates. But as a 
practical matter you can't. \ use the ex¬ 
ample of an insurance policy number 
where they've used the date of issue as 
part of the policy number. You can’t in¬ 
sert a 19 in an insurance policy number. 

DDJ: And windowing? 

BB: Also flawed. For one thing it's evanes¬ 
cent. Another thing is a bt of people want 
sliding windows and by that time you 
don't know what's going on. 

And dien you've got diese 28 guys. You 
know that one? You know what happens? 
Well on IBJV1 equipment when you say 
pic99, boy, that’s always gonna lie a pos¬ 
itive number, right? Well, take my birthdate, 
1920, That's stored as 20, you subtract 28 
and you get minus eight and the compil¬ 
er says, 1 know diis value is always posi¬ 
tive but just to be safe, I'm gonna overwrite 
it to make it positive, and I have every right 
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to do that. And now I’m a plus eight, right? 
And on the way out, you add 28 back, and 
I was bom in 1936, Well. I can use the ex¬ 
tra years, but it’s just not gonna work. And 
there are people out there selling that. 

DDJz What you 1 vc developed is an object- 
code solution. You piggyback millenlum 
and century marks onto die two year bytes. 
As 1 understand it, you rweak the data in 
thus way and then you sidetrack the object 
code that deals with dates into subroutines 
that understand this tweaked data. But 
maybe you’d letter explain it. 

BB: WeVe made a microprogram that will 
operate on Big it arithmetic. Bigil is short 
for "Bemer digit." 


DDJz Bigitizing data is what I called tweak¬ 
ing it. How does that work? 

BB: We put a millenium century indicator 
over the decade digit so we can handle 
dates from 1600 up to 2099. The problem, 
of course, is that the hardware won’t han¬ 
dle it. So whenever we think we're going 
to come upon one of these things we say, 
hey we have to handle that by a subrou¬ 
tine. And we examine the object program. 
First we examine the opcode and if it says 
drive the printer, we say, hey we’re not 
doing any arithmetic on years. If it says 
add and it’s a decimal add, we derail that. 
But only if we look at the operands and 
they’re of appropriate length. If it’s an add 
of two 16-digit operands, we say that has 
nothing to do with years and we don’t de¬ 
rail it. Bui every time we come across an 
opcode that could possibly be an [ap¬ 
propriate] arithmetic instruction, we derail 
it to this subroutine and we put it itself in 
a table of derailed instructions. 

So when the object code runs, it hits this 
derail, indexes over the instructions in there, 
plays like a regular accumulator in a ma¬ 
chine, goes out and gets the operands, in¬ 
spects them, and if one of them has a Big¬ 
it and you’re making a comparison and the 
other doesn’t have a Bigit in it, it says, wait 
a minute. How can you be comparing a 
year value to a nonyear value? Maylie that 
should be a Bigit, too. 

When we get down to that point we ex¬ 
pand it to either 16, 17,18, 19, or 20, do the 
arithmetic, recompress it, put it back where 
the answer is supposed to go and then come 
back to die original machine. 1 describe this 
as following the Napoleonic code where 
you're guilty until proven innocent. 

But when you are proven innocent, 
when we Find that it's not [date arithmetic], 
we take it out of the loop and put the orig¬ 
inal instruction back where it was. And 
thus we’ve reduced any time we can. 
However we’ve got a lot of people who 
say we don’t care if it does add time; just 
iix it. But, as a matter of fact, we don’t 
add all that much time. Can’t detect it. 


DDJz And doesn’t the Bigitized data look 
the same as the old data to old unmodi¬ 
fied programs? 

BB- Thais another nice thing about this, as 
it applies to IBM equipment at least. They 
use jxicked decimal instructions, and when 
you pack a number only [one] zone gets 
[retained]. All the other zones disappear. If 
you run the old program against die old 
data, that’s what you had, right? If you run 
the enabled program against the old data, 
there isn’t any Bigit data there, so the pro¬ 
gram runs exactly as before. Now here’s 
the kteken When you run the old program 
against the new data, it doesn't see die 
decade zone because it gets tossed away 
in the packed decimal operation. And you 
still get the same wrong answer. Or if you 
were getting correct answers, no problem. 

What that means is that you can put up 
one application at a time. All the other ap¬ 
proaches make you change all your ap¬ 
plications and databases simultaneously. 

So, even if the first application Digitizes 
the data, the other applications can still 
am as they were. They may be wrong but 
at least they’re running. 

DDJi So where does the order of magni¬ 
tude advantage over other approaches 
come from? 

BB: We don’t touch the source program 
and we flu n't cause any errors to be made 
in the source program. So we don't have 
this 50 percent testing time that everylxxiy 
else has. We can test statistically. If it ran 
before there’s good odds it’ll run again if 
our device works properly. And instead 
of m users times n programs, we only 
have to check one program—ours. 

Now if your testing time goes down 
from 50 percent to 5 percent, you’ve got 
time before 20CX) to go through [the code) 
in case there’s any really weird stuff from 
programmer stupidity or clever tricks. 
DDJz Thanks Bob. 

Robs approach won’t help you if you’re 
working on PCs or with embedded sys¬ 
tems : It’s strictly a mainframe fix, at least 
so far. Hut that’s where Bob’s expertise is. 
There were a lot of things Bob and 1 didn't 
have time to get into, such as how his soft¬ 
ware figures out what data needs to lie 
Digitized, You can read Bob’s many arti¬ 
cles on the Year 2000 problem and his 
product, Vertex 2000, at the BMR Software 
web site (http^/www.bmrsoftware.com/). 

By the way, it’s now a little easier to re¬ 
member the URL for my web site, where 
I keep updated links, corrections, and 
reader feedback for this column. It’s http:// 
www.swaine.ajm/. And my new' e-mail ad¬ 
dress is m.swaine@swaine.com. 


DDJ 



Head Essential COM to 
ensure your survival 



Essential COM 



0-201-63446-5, 464 pages, £34,95 

Don Box, cofounder of DevelopMentor 
and well-known for training events like 
Guerrilla COM, knows COM can be 
challenging. That's why he wrote 
Essential COM. Don's book explains the 
why of COM not just the how. Master 
COM by learning to apply the model 
creatively and effectively - pick up 
Essential COM today! 

"Nobody knows COM 
better than Don Box." 
—Charlie Kindet, COM Guy, 
Microsoft Corporation 

"Don Box makes it possi¬ 
ble for mere mortals to 
join the COM cognoscenti. 
If you're a C++ COM pro¬ 
grammer, buy this book." 
-David Chappell, Principal, 
Chappell & Associates and 
author of Understanding 
ActiveX and OLE 

^ Addison-Wesley 


Check out the table of contents 
and sample chapters at: 
http:www.awl.com/cseng/ 
specials/ddj498.btml/ 

Find this book wherever 
technical books are sold or by 
calling (800) 822-6339. 
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C PROGRAMMING 

MIDIFile: Standard 
MIDI Format File Parsing 

Al Stevens 



R ecently, 1 undertook some projects 
that involved reading and processing 
data from files written in die Standard 
MIDI Format (SMF)—an architecture 
that, among other things, records MIDI 
messages in data streams, also called “se¬ 
quences.” Sequences of MIDI messages are 
w hat a sequencer program or device sends 
in real time to electronic musical instru¬ 
ments to play a production. SMF prescribes 
a standard formal to record the real-time 
messages on disk, as well as other infor¬ 
mation—tempo, for example—that a 
playback system needs to accurately re¬ 
produce the production, and information 
meaningful to the human MIDI program¬ 
mer— key signature, time signature, and 
so on. So, irrespective of a particular se¬ 
quencer's internal representation of these 
data, if tiie sequencer can generate an SMF 
file, other sequencers can play back die 
production. 

Furthermore, a computer program that 
can read an SMF file can process MIDI 
production data tn support of various ap¬ 
plications, which is the point of this dis¬ 
cussion. 

Each message in an SMF tile represents 
an event that a MIDI system processes in 
die course of die playback of a musical 
production. There are three kinds of events; 
MIDI events, metaevents, and system- 
exclusive events. Each event includes a 
time stamp that specifies when the event 
is to be processed. It is expressed as a 
delta number of dock ticks counted from 
the beginning of the track in which the 
event is stored. More about tracks later. 

MIDI events are real-time messages. 
They play notes on multiple simulated in* 
struments, specify "aftertouch” values, en¬ 
code controller events (volume, pan, and 
sustain pedal, for example), process pitch 
bends, assign channels to instrument 
sounds (patches), control lighting devices, 
and so on. 


Al is a DDJ contributing editor He can be 
contacted at astevens@ddj.com. 


Metaevents specify such things as tem¬ 
po, how to relate clock ticks to tempo, 
time signature, key signature, channel as¬ 
signments, and text fields to name the 
song, the instruments, and the tracks. 

System-exclusive events contain data 
defined by die manufacturer of the device 
or program that processes the SMF file. 

An SMF file comprises a header chunk 
followed by one or more track chunks. 
There are three SMF formats, 

• Format 0 is a single-track file with all 
the channel information and metaevents 
mixed in the single track, 

• Format 1, the most common format, 
contains a track of metaevents and one 
or more tracks that contain channel in¬ 
formation, Typically, each channel (in¬ 
strument) has its own track, but noth¬ 
ing says tiiat it must. Format 1 SMF file 
tracks are processed together. The delta 
lime fields of each are relative to the be¬ 
ginning of the production. 

• Format 2 is a multiple-track file where¬ 
in the tracks operate independently of 
one another™ each track can represent 
a different production. 

All of this suggests that die SMF archi¬ 
tecture is complex, bewildering, and beg¬ 
ging to be encapsulated so programmers 
can ignore the details and work at high¬ 
er levels of abstraction. 

MIDI tends to embrace an open, co¬ 
operative culture. Much of the software 
that MIDI users need is available as 
shareware or freeware. An Internet 
search revealed two freeware C-function 
libraries that parse MIDI files. One library 
is called Midifile (http://www.rH3such.conV 
midifile/midifde.zip). It is authored by 
Tim Thompson and augmented by Michael 
Czeiszperger. The other library is also 
called Midifile (http://www.oobfick.CQm/ 
software/midifile.tar.gz) and is a rewrite 
of Thompson's library by Andrew 
Arensburger. He rewrote the library be¬ 
cause he “wasn’t quite satisfied with 


[Thompson's! implementation.” Both li¬ 
braries were written in the 1980s in K&R 
C, and both work in essentially Lhe same 
way, with these differences: Arensburg- 
er’s rewrite does not include Czeiszperg- 
er’s enhancements to write SMF files; the 
rewrite does not support system-exclu¬ 
sive events; the rewrite sends error con¬ 
ditions to the library user whereas the 
original program simply exits the pro¬ 
gram upon encountering the first error. 
(Tim’s site at http://www.nosuch.com/ 
includes lots of other MIDI software that 
programmers might find interesting. In 
particular, check out his Key Kit MIDI 
progra naming envi ron ment.) 

The Midifile API of both libraries for 
parsing SMF files comprises a set of func¬ 
tion pointers. An application assigns the 
addresses of custom functions to the 
pointers and calls a library function to 
begin parsing the file. The library pro¬ 
cesses the file arid calls each function as 
the parser encounters the header, the 
tracks, and the events in the file. It is up 
to the using program to do something 
meaningful with the data. If the appli¬ 
cation does not initialize a pointer, the 
event associated with that pointer is 
parsed and bypassed. One notable fea¬ 
ture of the Midifile API is the require¬ 
ment for the application to manage open¬ 
ing, closing, and reading bytes from the 
SMF file. One of the function pointers 
expects to point to a user-provided func¬ 
tion that returns the next sequential byte 
in the file. 

Czeiszperger’s enhancements permit an 
application to write SMF fdes. You initial¬ 
ize pointers to write characters to the 
stream and to write tracks. Then you call 
a function that writes the header and calls 
your track function for each track. From 
w r idiin that function, you call functions to 
write events. 

Midifile contains all of the functionali¬ 
ty that I needed for my project and more, 
so I decided to use the two libraries as in¬ 
spiration for a rewrite in C++, 
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The MIDIFiie Class 

lies ides representing a complex problem 
domain in need of object-oriented en¬ 
capsulation, there are two facets of both 
Midifile implementations to which C++ is 
particularly well suited. First, error pro¬ 
cessing is better handled by C++ excep¬ 
tion handling than by the common C id¬ 
iom of returning error conditions. Second, 
the function pointer AIM mechanism Ls bet¬ 
ter represented by virtual functions in a 
C++ base class than by a set of global 
function pointers lhat the library user ini¬ 
tializes. With C++, the application derives 
a class from an abstract base class that en¬ 
capsulates the details of parsing the SMF 
file. The base class contains virtual mem¬ 
ber functions that the file parsing algo¬ 
rithm calls for each of the events. The de¬ 
rived class overrides only those virtual 
functions related to the events that the ap¬ 
plication wants to view. For example, a 
MIDI jukebox application would not need 
to see the time and key signature 
metaevents, but would need to see the 
tempo metaevent. The same application 
might want to display die Lexl that iden¬ 
tifies the production—the song title, for 
example, but would nol be interested in 
the instrument name text that identifies 
each track. 

The complete project — including all 
source code and a MIDI song file that one 
of the example programs creates— is 
available electronically (see “Resource Cen¬ 
ter, 11 page 3)- Midifile.li, for instance, is the 
header lile that defines the MIDIFiie class. 
MIDIFiie is an abstract base class. To use 
it for parsing or generating an SMF file, 
an application derives a class from it, pro¬ 
vides some arguments, and overrides some 
virtual functions. The MIDI File member 
functions are defined in a file named midi- 
file, epp, which is not published here l ie- 
cause of its length. 

The file begins with typedefc dial de¬ 
fine the aliases, Short and Long. The pur¬ 
pose for these typedefo is to ensure that 
certain data types are 16 and 32 bits in 
width, necessary data types for some of 
the internal SMF data representations. 
When you instantiate an object of a class 
derived from MIDI File, the constructor 
makes die following assertion: 

assert (sizeof (Short) =2&&£i2ieof (Uxig) = =4) “ 

Ifte next part of midifile.h declares the 
exception classes that MIDIFiie throws. First 
is a ^define macro named MFX that ab¬ 
breviates die declarations. Following that 
are invocations of MFX to declare classes 
derived from ski::mntime_error and to pio- 
vide text to describe 1 the nature of die ex¬ 
ceptions, (Whenever people deprecate the 
C++ preprocessor and wish it would go 
away, I think of handy constructions such 
as the MFX macro, and I smile,) 


A list of const d ecla r ations define sym¬ 
bols for the MIDI events, and an EueniBuffer 
class manages the allocation of memory to 
hold the data values of variable-length 
events. 

Next is the MIDIFiie class declaration. 
Objects of the class can lie instantiated to 
parse an existing SMF rile or to create a 
new one depending on which construc¬ 
tor you use. Each constructor initializes a 
pointer to either tin ifstream or an of si ream 
object and initialized the other pointer to 
zero. Subsequent member functions use 
these values to read or write the SMF tile 


MIDI events are 
real-time messages 


and also to determine whether the object 
was instantiated for input or output. The 
constructors and destructor are protected 
to ensure dial you derive a class from 
MIDIFiie rather than instantiating an ob¬ 
ject of MIDIFiie, (Well, almost; a class de¬ 
rived From MIDIFiie could itself instanti¬ 
ate a MIDIFiie object, but there would l^e 
no reason to do so*) 

MIDIFiie includes private member func¬ 
tions to manage the way that SMF files 
store numerical values. SMI" uses a byte 
order that is the reverse of that used by 
Intel processors, so some juggling is nec¬ 
essary when reading and writing those 
formats between memory and the file. 
Some values are stored in two bytes. The 
song's tempo is stored in three bytes. The 
header length and track length fields arc 
stored in four bytes. All values are treat 
ed as twos-complement signed integers* 
Event data lengths and delta times are 
stored in a variable-length format where¬ 
in all but tlie last byte lias the most sig¬ 
nificant bit set to one. Only bits 0-7 of 
the bytes in a variable-length integer con¬ 
tain numerical data. If you download the 
project, you can view these conversion 
functions in midifile,cpp. 

Reading an SMF File 

An application reads an SMF file by first 
deriving a class from MIDIFiie, overriding 
virtual functions that process each of the 
MIDIFiie components* The constructor of 
the derived class passes to the base MIDI¬ 
Fiie class constructor a reference to an open 
SMF file. The MIDIFiie constructor has a 
second argument, a bool that tells MIDIFiie 


whether to bypass system-exclusive events 
{true) or to allocate memory for them on 
die heap and process them (false), System- 
exclusive events can involve large memo¬ 
ry allocations. This mechanism permits a 
program to ignore them, 

The application opens the SMF file and 
instantiates an object of die derived class. 
The application then calls die MIDIFiie:: 
ReadMIDIFilei) function, which parses the 
file, calls the virtual functions for each 
event, and throws exceptions when it finds 
errors in die file. 

What the application does depends 
on which virtual functions it overrides 
and what il does with the SMF data. 
Miditest.cpp (available electronically) is 
an application that overloads all the vir¬ 
tual functions and displays the contents 
of the SMF file on the console. The pro¬ 
gram catches exceptions and displays their 
text on the console. As such, miditest is a 
good utility program for testing the valid¬ 
ity of SMF files and repotting what and 
where the problems are. Actually, if l had 
not written this program first, getting the 
next one to work and debugging die pan 
of Midifile that creates an SMF file would 
have been much more difficult. 

Each overridden function represents 
one part of the SMF file. The first func¬ 
tion called is the Header function* Next 
is the Start Track function for the first 
track. After that come the metaevent and 
MIDI-event functions for the track with 
the EndOfl'mck function signifying that 
there are no more events in the track. The 
track and event functions repeat for each 
track in the file. 

Each overridden function is passed a 
delta time value as its first argument. If die 
application overrides all the functions, the 
delta value is the same one recorded in 
the file with the events. Otherwise, the 
value passed is the delta time since the 
last event that was intercepted by an over¬ 
ridden function. An overridden function 
must not call the base class function it 
overrides. If it does, the delta time com¬ 
putation is compromised. 

Writing an SMF File 

An application writes an SMF file also by 
deriving a class from MIDIFiie. In this case 
the constructor passes to the MIDIFiie con¬ 
structor a reference to an of stream object 
and variables that specify the SMF format 
(0, 1, or 2), the number of tracks, and a 
value that specifies Flow many clock ticks 
there are in a quarter note. The derived 
class overrides the StartTrack virtual func¬ 
tion. The overriding function uses the track 
numiter to determine which track is to be 
written, and calls MIDIFiie member func¬ 
tions to write each track's events. 

Listing One (listing begins on page 
137) is playmidi.cpp, an application that 
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generates a simple SMF file that plays 
the first part of the “William 1'ell Over¬ 
ture” (no royalties to pay. that’s why) 
on the piano and drums. The SMF file 
is included in the files you can down¬ 
load with this project. The application 
derives the WmTellOverture class from 
MIDIFile as explained previously, opens 
the of stream file, instantiates an object 
of the class for the file, and calls the 
WriteMSyiFile function. Observe that die 
MIDI File functions that Wm Tell Over¬ 
ture: :StartTmck calls to write events to 
the track are the same virtual functions 
that an application overrides to read an 
existing SMF file, 

MiDifile Enhancements 

Even though the MlDIFile class encapsu¬ 
lates much of die SMF file processing, you 
still view it from a relatively low level, The 
application lias to know how to convert 
between musical and SMF notation. For 
example, an application lias to know that 
the denominator of a time signature is re¬ 
ally a power of two so that a march in 6/8 
time is expressed as 6,3 (six and two raised 
to the third power). I don’t know why, 
but that's how they did it. Key signature 
is a number lietween -1 and +7 to spec¬ 
ify the number of flats or sharps and a 
boolean integer to indicate whether the 
key is minor (1) or major (0). Tempo and 
event time is another enigma. The head¬ 
er specifies the number of delta Held dock 
ticks in a quarter note unless the number 
is negative, in which case it represents die 
number of ticks in a second. The tempo 
field itself is the number of microseconds 
in a quarter note. Notes are just numbers 
rather than in notation that musicians 
would understand. There's more, but I 
won't go into it here. The point is that to 
use Midifile, you have to understand tjiese 
algorithms, A class library at a higher lev¬ 
el of abstraction would understand musi¬ 
cal notation, implement that in the API, 
and do the conversions for you. As 1 use 
this library, I expect to understand better 
where further encapsulation will make it 
more usable, 

Tfie C++ Programming 
Language, Third Edition 

Bjame Stroustrup's The C++ Programming 
Language, Third Edition (Addison-Wesley, 
1997) has been available for several 
months. This work, by die creator of C++, 
is die definitive treatment of the subject 
and has been since its first edition in 1987. 
I must confess that I did not care for the 
first edition, I had expected a tutorial ap¬ 
proach as elegant as the classic K&R white 
book. But then, K&R was about C, a pro¬ 
gramming language that supported a fa¬ 
miliar programming model. The C++ pro¬ 
gramming model was new to most of us 


ten years ago, and Stroustrup's first edi¬ 
tion was daunting, to say the least. Ltjok¬ 
ing at it now, I find it far less so and much 
easier to read. 

Comparing the first and third editions 
of The C++ Programming Language pro¬ 
vides insight into how the C++ language 
has grown and changed in the past 
decade. The third edition has almost three 
times the number of pages and a slightly 
different organization. Whereas the first 
edition included a 67-page language ref¬ 
erence manual at the end, the third edi¬ 
tion includes only a language grammar 

MIDIFile includes 
private member 
functions to manage 
the way that 
SMF files store 
numerical values 


section to represent formal language def¬ 
inition. This is appropriate. The ANS1/1SO 
Standard document, which is now the for¬ 
mal language and library definition, is it¬ 
self about 750 pages king. Stroustrup plans 
to publish The Annotated C++ Language 
Standard (coauthored by Andrew Koenig, 
the ANSI C++ committee's Project Editor) 
sometime this year. 

The third edition takes a tutorial ap¬ 
proach with many of Stroustrup’s personal 
programming philosophies. The author’s 
explanations of how he uses language fea¬ 
tures provide examples for learning the 
behavior of those features. He also ex¬ 
plains code idioms that some program¬ 
mers routinely use but that he finds in¬ 
appropriate. 

As much as possible, the third edition 
reflects Standard C++. When small lan¬ 
guage features are Found to lie missing, 
particularly new ones, Stroustrup pledges 
to add them to a future printing, 

The book includes many code exam¬ 
ples. There is no diskette or CD-ROM , be¬ 
cause Stroustrup avoids a teaching ap¬ 
proach wherein readers compile and run 
examples. His examples are mostly code 
fragments that demonstrate the points he 
makes and the issues he addresses. The 
code fragments are readable, meaningful, 
and neither frivolous nor cute, and since 


you do not compile them, you need not 
worry that your compiler does not fully 
support Standard C++, 

There are four parts to the lx>dy of the 
book; Tan I: Basic Facilities;" Tart II; Ab¬ 
stract Mechanisms; 1 ’ “Part IN: The Standard 
Library;" and “Part IV; Design Using C++.” 
Even if you are already a seasoned C++ 
programmer, Pail IV, which is a rewrite of 
several chapters from the second edition, 
is worth the price of the book* It describes 
Stroustrup's philosophies cm the design 
and development cycle of a software pro¬ 
ject involving C++. In his words, Part IV 
aims “to bridge the gap between would- 
be language-independent design and pro¬ 
gramming that is myopically focused on 
details." 

The three appendixes are: U A: The C++ 
Grammar;” “B; Compatibility;” and tl C: 
Technicalities." Appendix B discusses the 
differences between C and C++ and ex¬ 
plains how the languages have become 
more compatible over time. Some of this 
convergence results from changes being 
made to the C specification (double-slash 
comments and no implicit ini for exam¬ 
ple), The appendix also discusses die is¬ 
sues related to porting C++ code from old¬ 
er C++ implementations, advising that, 
where possible, you should use the latest 
implementation of a compiler so that new¬ 
er features are available to you. 

Appendix C is about technical details 
that a programmer faces that are not nec¬ 
essarily language issues. 1 particularly like 
the discussion on the problems associat¬ 
ed with traditional multldimensioned ar¬ 
rays as compared to using STL containers 
to achieve the same result without the 
headaches. 

This book is an essential addition to 
a C++ programmer’s library. It is not for 
dummies, and iL wouldn't be my first 
choice for an entry-level, self-help tu¬ 
torial on C++ for beginning program¬ 
mers. It is, however, an excellent textbook 
for programmers who are self-motivated 
and students who sLudy under the 
watchful care of a skilled instructor. As 
an experienced C++ programmer, l find 
the book useful as a reference to lan¬ 
guage usage and behavior. The author 
invented the language and then stayed 
dose to the standardization and inno¬ 
vation process for the duration, always 
maintaining a careful vigilance over the 
evolution of his brainchild. Conse¬ 
quently, this book serves, for those who 
do noL care to pore over the ANSI/ISO 
document (or the promised annotated 
version), as the authority on the Stan¬ 
dard C++ language, how it works, and 
how you should use it. 

DDJ 

(Listing begins on page 137*) 
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JAVA Q& A _ 

How Do I Create a 
Streaming Audio Java Applet? 

L. Richard Moore 



S treaming audio refers to audio that can 
be downloaded aL the same speed it is 
played. If the download speed match¬ 
es or exceeds ilie rate at which die au¬ 
dio data is played, the audio can lx* played 
indefinitely without any interruptions* There 
are many practical applications for this tech¬ 
nology on the Internet, and numerous com¬ 
mercial applications in the streaming audio 
client/server market have emerged. In this 
article, TO present idtAudio, a streaming au¬ 
dio applet written in Java. (The source axle 
for idtAudio and the companion program, 
idtAudkxcpp, is available electronically; see 
“Resource Center," page 3.) 

As a Java applet, idtAudio inherits sever¬ 
al advantages over most other streaming au¬ 
dio tools. Its most distinct advantage Ls that 
users never need to install any software. As 
an applet, the code is downloaded auto¬ 
matically into the browser as needed with¬ 
out user intervention. Furthermore, since 
the code is downloaded for each audio in¬ 
stance, there Ls no possibility of version mis¬ 
matches between the audio data and the 
player. Also, applets are portable. The idt¬ 
Audio applet seems to run equally well on 
Netscape Navigator 3.01 and Microsoft In¬ 
ternet Explorer 3,01 on Windows 95/NT. 

The design of idtAudio also sports sev¬ 
eral distinctive features. The streaming 
protocol is built on standard HTTP, which 
is the same protocol used by all web 
browsers. As such, the applet can down¬ 
load its audio data through any firewall 
that allows web activity* This also means 
that the server has no requirements for 
special drivers or extensions. 

Compression 

Compression is the key to transmitting 
streaming audio data at low band widths. 
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Audio data is considered to be one of the 
most difficult mediums to compress, but 
w hen using Java, the algorithm must also 
lie very fast, 

1 researched several existing audio com¬ 
pression technologies, starting with MPEG* 
While its audio quality is excellent and 
several code samples am readily available, 
MPEGs psychoacoustic algorithm places 
a heavy toll on the CPU and cannot be 
implemented effectively in Java. 

ADFCM is a simpler algorithm that would 
seem more likely to work in Lite Java en¬ 
vironment, Unfortunately, ADPCM does not 
easily compress to the levels required for 
streaming modem transmission. The qual¬ 
ity becomes poor, and the specific algo¬ 
rithms used become more secretive. 

Other algorithms had similar problems, 
so 1 was forced to develop my own. Be¬ 
cause compression relies on predictable 
patterns to lie effective, I began by study¬ 
ing graphs of w ave forms (see Figure I). 
The sinusoidal qualities of the wave sug¬ 
gested a first algorithm: Compress by cap¬ 
turing the peaks and valleys only, and de¬ 
compress by programmatically filling in 
the remaining points using a sinusoidal 
function. With this algorithm, the decom¬ 
pressed audio had an annoying echo ef¬ 
fect, More importantly, 1 aLso realized that 
die algorithm was uncompromising in that 
it could never predictably compress to any 
particular ratio. For example* a wave com¬ 
posed entirely of peaks and valleys would 
double in size after I icing "compressed," 

A common technique used in modem 
physics is "curve fitting/’ where empirical 
data is approximated to a mathematical 
function such as a line or a paralxila. My 
next attempt at audio compression broke 
Up the wave form into a series of straight 
lines that approximate the original The au¬ 
dio quality was much improved, and the 
compression ratio could lie controlled by 
how closely the original wave form w as 
followed* Tills tolerance level could be it¬ 
eratively increased until the desired com¬ 
pression w as achieved. The only problem 
was that* to achieve the required com¬ 


pression ratios, the tolerances had to lie set 
very high. This degraded the audio quali¬ 
ty too much. \ still needed a way to in¬ 
crease 1 the compression without sacrificing 
sound quality* 

A frequency analysis of the audio data 
showed that smaller values occur much 
more regularly than larger ones, so it seems 
natural to sacrifice precision at higher am¬ 
plitudes and preserve more detail near the 
smaller amplitudes. Originally, each audio 
sample is a signed eight-bit byte. idtAu¬ 
dio further compresses the audio data by 
storing a four-bit value for the amplitude 
instead of the full eight bits. The four-bit 
signed value Ls the square rexjt of the eight- 
hit value (with negative amplitudes repre¬ 
sented by negative square roots). Right or 
wrong* this extra 33 percent of compres¬ 
sion was enough to make idtAudio work. 

Although the audio compression pro¬ 
gram, enccxle*exe T was originally written 
in Java, performance was poor on large 
files, so 1 rewrote it in C. The C version 
seems quick enough for streaming en¬ 
coding, Thus, we have laid the founda¬ 
tions for an Internet telephone, but that 
would be a different article entirely,*, 

Decoding and Playing 
the Compressed Audto 

Due to Java's poor audio support, the au¬ 
dio player applet requires the use of Sun’s 
unsupported Audio Player class in the 
sun.audio class library. (It is this depen¬ 
dency that prevents idtAudio from work¬ 
ing with every java-capable browser*) The 
AudioPktyer class requires an InputStrmm 
object from which it reads and plays ulaw- 
enccxled audio data until iL reaches the end 
of file. The idlAudio applet derives its own 
class from InputStream called Fifolnput- 
Stream fwhich wall be referred to as FIS 
for reasons I’ll explain shortly), and sul> 
mits it to the AudioPlayer class* FIS allows 
one write thread and one read thread to 
communicate with fairly good performance. 

A simple architectural solution* then* 
would have one thread dow nloading* de¬ 
coding. and depositing audio data into the 
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FIS queue ? while another thread (imple¬ 
mented via AudioPlayer) would read and 
play the audio data; see Figure 2. This de¬ 
sign worked well when the compressed 
audio data was read from a local hard disk. 
Streaming from a true Internet URL was a 
different story. The audio was broken up 
and my modem lights indicated a lack of 
activity. Some quick tests of URL download 
times with C and Java programs suggested 
that the server's performance was not an 
issue, and that the Java language, itself, was 
not an issue. Clearly, the simple design de¬ 
scribed earlier was not an optimal solution. 

Because a serial download leaves the 
CPU mostly idle, it made sense to break 
up the download and decode tasks into 
separate threads; see Figure 3 The FIS class 
was the perfect way for the two threads to 
communicate. A DoT (short for ' download 
thread”) class, derived from Thread, would 
lie responsible for downloading the audio 
data and depositing it into the first FIS ob¬ 
ject. The DeT (short for “decode thread") 
class, also derived from Thread , would be 
responsible for reading the first FIS object, 
decoding the data, and depositing the re¬ 
sults into the second FIS. The AudioPlay- 
er class would then read and play the sec¬ 
ond FIS at its leisure. Tills solution proved 
to be both successful and elegant. 

Optimizing Java Code 

Some serious optimization was still re¬ 
quired to make the player perform on a 
486 running Netscape's slightly slower im¬ 
plementation of Java. 

First, the synch ronized sections of code 
were minimized. Some basic common 
sense pays off here. Consider die nature 
of our FIFO queue: One thread reads at 
the tail and another writes to the head. 
The only point of contention is the full 
member that indicates how full the queue 
is. Write operations to this memlier are 
synchronized. Read operations are not 
synchronized: This does cany a slight risk, 
but the risk only occurs under an unusu¬ 
al set of circumstances and is considered 
acceptable for the performance gain. 

Every time an array is accessed In Java, 
the language performs some internal 
bounds checking. This checking can be 
costly. Not only was array access mini¬ 
mized, but looping itself was also mini¬ 
mized. Buffered access to the FIS objects 
was encouraged wherever possible in fa¬ 
vor of character access. 

Probably the most significant perfor¬ 
mance boost came from the System.ar- 
raycopyO function, which copies one ar¬ 
ray’s contents to another, given offsets and 
lengths into each. System.arraycopyO is 
analogous to niemcpyi ) in C and similar 
in performance. Sadly, the only alterna¬ 
tive in Java is to use a standard loop and 
iteratively copy one member at a time. 


Instead of a circular FIFO, I considered 
using a linked list of dynamically allocat¬ 
ed buffers. When data was entered into a 
FIFO, a new buffer would be allocated 
and linked to the head of die list. Reading 
would retrieve and unlink tail-end buffers. 
Such a scheme could avoid one of two 
calls to $ystem.arnzywpy(X but would add 
the expense of garbage collection and ol> 
ject creation. A couple of test programs 
showed that System.arraycopyO was still 
slightly faster dian the linked list scheme, 
so I never implemented the change. 
Information on Java optimization is 
sketchy at best, Most texts suggest the 
generic approach of optimizing the In¬ 
nermost loops. The tricks I’ve just de¬ 
scribed showed remarkable performance 
gains, however. These techniques can be 
applied to almost any program. 

Protecting Java Code 

You may wonder why I chose such cryp¬ 
tic naming conventions lor my classes. Orig¬ 
inally, they were called FifolnputStream, 
DecodeThread, and Doumtoadlhread, but 
the destiny of my applet was uncertain at 
diat time. Because Java classes are down¬ 
loaded by name, anybody using the audio 
player could quickly discover some key 
design secrets! The names were too infor¬ 
mative, so 1 renamed them to FIS, DeT, and 
Dot, names that would be meaningless to 
anyone who didn't already understand the 
applet’s internal architecture. 

Another protection mechanism demon¬ 
strated by idtAudio is tire “time-bomb” fea¬ 
ture often seen in shareware programs, 
(The code is commented out to effective¬ 
ly disable die time-bomb feature, but re¬ 
mains for your reference,) Typically, an ex¬ 
piring license scheme is nol very effective 
in Java applets because a web developer 
could easily download a licensed copy of 
the applet from another web site. The trick 
to making tills work is to place the a bomb M 
in the audio data instead of die applet it¬ 
self. By doing so, die key to the license is 
embedded in data that is useless to every¬ 
one but the web site owner who owns the 
license. This is accomplished rather simply 
by placing a time stamp within the head¬ 
er information. The applet checks the time 
stamp and refuses to play encoded files 
that are too old. When registered, howev¬ 
er, die encoder inserts null data into die 
time stamp slot, which indicates that die 
encoded audio file never expires. 

Using idtAudio 

The first step to using the idtAudio applet 
is to create your audio files, 1 use Syntrii- 
lium’s Cool Edit 96 to create mine; even 
the disabled shareware version has enough 
features to do the job, Record an audio 
session and save it as an 8000 MHz eight- 
bit sam file. 



Figure 1: Graph of a typical 
audio wave , 



Figure 2: Block diagram of first 
applet algorithm. 



Figure 3: Block diagram of second 
and final applet algorithm , 
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Next, the audio File needs to be com¬ 
pressed, Use the encode, exe program 
for this. It requires an input File and an 
output file for command-line parame¬ 
ters. An optional diird parameter is ei¬ 
ther a tolerance level or a target band¬ 
width (in characters per second), 


depending upon the value given. If a 
tolerance is given, the audio file is com¬ 
pressed once at the specified tolerance. 
If a target bandwidth is specified, the 
audio file is compressed iteratively at in¬ 
creasing tolerances until the target 
bandwidth is reached. 


My experiments suggest that a 28.8 
modem connected at 24000 bps can sus¬ 
tain transmission speeds of approxi¬ 
mately 2300 cps, so this is the default 
target bandwidth when none is speci¬ 
fied. A 14,4 modem can only sustain 
bandwidths of about 1400 cps. Some ex¬ 
perimentation has uncovered a rule of 
thumb: Under good conditions with PPP, 
a modem's bandwidth in characters per 
second is approximately the modem's 
connection speed in bits per second di¬ 
vided by 10, To be safe, I usually sub¬ 
tract another 100 cps. 

After the encoded audio file is creat¬ 
ed, it must be stored on your web serv¬ 
er. You may wish to use a familiar MIME 
type if your server is picky about such 
things. The four applet classes for the 
player (De7\ Dol\ PIS , and idlAudio) must 
also be stored on the server. Add Exam¬ 
ple J to your HTML code to add the id- 
tAudio applet. 

In this example, Tish.idt” would be the 
name of ihe encoded audio file. The De¬ 
bug parameter sets the style of the applet's 
appearance; true provides more detailed 
information. The AutoPlay parameter in¬ 
dicates whether the applet should begin 
playing immediately or wait for the user 
to dick on it. 

Conclusion 

in retrospect, there are a number of vari¬ 
ations to the compression algorithm that 
could Increase compression and quality. 
You could even implement an “algorithm 
shifting” scheme dial would change com¬ 
pression algorithms on ihe fly based on 
an analysis of the audio data. 

Simpler improvements would be to 
take the amplitude compression into ac¬ 
count when calculating tolerances. The 
amplitude compression introduces sub¬ 
stantial error that is not considered by 
the tolerance level. Also, the amplitudes 
might be better stored as offsets instead 
of absolutes, similar to the ADPCM 
method. 

Nevertheless, the idtAudio applet ex¬ 
emplifies a number of concepts. First, it 
is a good example of object-oriented pro¬ 
gramming. In fact. I believe every object- 
oriented programming principle is demon¬ 
strated in some way within the applet's 
code. Next, it demonstrates the power of 
Java. Consider that the total code size is 
only 17 KB yet it implements a multi¬ 
threaded graphical applet with perfor¬ 
mance good enough to perform down¬ 
loading, number crunching, and real-time 
audio on a 486 machine. Lastly, it demon¬ 
strates a basic security and licensing 
scheme that developers may want to ap¬ 
ply to commercial applets. 

DDJ 


<applet code=“idtAudio,class" align="baseline" width* 
<param naine= w AndioStrefliiitM J B valuer*fish.idt 1 ^ 

<par am name="AutoPlay" value 1 *" false* > 

<param name*"Debug" value**true*> 

</applet> 


height*"30" id*”idtAudio"> 


Example 1: Adding the idtAudio applet , 
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ALGORITHM ALLEY 

Sorting and Searching 
Linked Lists in Java 

John Boyer 



Generic collection classes, such as those in the C++ Standard 
Template Library and java Developer s Kit, are nice to have 
around, in part because they encapsulate standard algorithms 
that are useful in many situations. But such libraries are hard 
to write—how do you know, for instance, that the particular 
algorithm you chose will work well across so many applica¬ 
tions? As you’ll see upon reading John's article, even experts 


such as those writing the JDK can sometimes overlook the 
best algorithm choices. 

J found John's discussion of linked list searching especial¬ 
ly interesting. Like many people, I failed to distinguish be¬ 
tween the cost of walking a link and the cost of a key com¬ 
parison. Understanding that distinction makes a big difference. 

—Tim Kientzle 


S ome common misconceptions in 
computing are that arrays can be sort¬ 
ed more quickly than linked lists; that 
the quicksort is the fastest compari¬ 
son sort on average; and that binary 
searching a linked list provides no bene¬ 
fit over a sequential search because both 
are 0(n). 

If you agree with any of the aforemen¬ 
tioned views, you're in good company. 
Hie list sort and search algorithms in ihe 
recently released Java Developeris Kit 
(JDK) L2 beta 2 are based on these mis¬ 
conceptions, which results in suboptimal 
performance. 

In this article, I'll present Java imple¬ 
mentations (available electronically, see '’Re¬ 
source Center,” page 3) of more-efficient 
list sort and search algorithms, which are 
based, in part, on my undergraduate the¬ 
sis work in 1988. These algorithms also 
have the advantage that they work on both 
singly and doubly linked lists (although the 
sorts require a simple post-processing loop 
for doubly linked lists to reconnect all of 
the references to previous nodes). 

Performance 

Although T collected empirical results on 
a 64-MB 200-MH2 Pentium running Win¬ 
dows 95,1 also ran the tests on other ma- 
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chines. While the numbers change, the 
conclusions are the same. 

The test program allows you to spec¬ 
ify the number of randomly generated 
ten-character strings to sort or search. As 
you type your number, the program will 
not show you what you are typing. This 
is a bug in java's Kbd.readLineC) func¬ 
tion. However, if you simply type your 
number on faith and press Enter, your in¬ 
put will be properly recognized. You can 
then choose a sort or search test. The sort 
test runs five different sort functions: the 
array quicksort in Arrays.$ort(), the linked- 
list quicksort in Collect ions.sorti), a new 
linked-list quicksort, a new linked-list 
merge sort, and a non recursive version 
of the linked-list merge sort. The search 
test compares the new binary search to 
the sequential search in Collections,hi- 
fi a rySea rch ( ). T he sea rch Les is si in pi y 
use the search to find every siring in the 
array or list. 

Figure I shows the results of the sort 
tests on 16,000 strings. The limitations of 
Systcrn cu rren tTimeMilUsf ) (whieh is 
only updated eveiy 50 to 60 milliseconds 
on Windows 95) make it difficult to dis¬ 
tinguish the two list merge sorts. With 
32,000 strings, the Java VM froze with¬ 
out any errors, so I was unable to em¬ 
pirically test which of my two new lust 
sorts was faster. 

All of die new linked-list functions sort 
linked lists faster than Arrays.sortf } can 
sort the equivalent array. Since Collec¬ 
tions sortf ) calls ArrayssorK f naturally 
all of the new linked-list sons are much 


faster than the JDK’s linked-list sort. This 
was not just on average but in every test 
case. The most notable result is dial die 
JDKvl .2b2 Collections.sortO was 33 per¬ 
cent slower than the recursive merge sort, 
and the Arrays.sortf) function was slow¬ 
er than the linked-list merge sort by 22 
percent. Best of all, the merge sort can¬ 
not degrade to quadratic time for any data 
set, Thus, it is not true that linked-list sort¬ 
ing must be slower than array sorting, nor 
is it true that the quicksort is the fastest 
comparison sort. 

As Figure 2 shows, the linked-list bi¬ 
nary search was over nine times faster 
on average than the sequential search 
using 1000 to 4000 randomly generated 
strings. The shapes for both searches are 
parabolic because die searches are Ofn), 
and the test program searches for every 
string in the list, yielding Q(n 2 ) test per¬ 
formance. 

Small Errors, Big Impact 

When first testing Collections.sort() } 1 no¬ 
ticed it was getting results that were poor¬ 
er than I expected—140 seconds for 16,000 
strings. Collections,sotK } only adds a few 
linear time operations to an CXn log n) ar¬ 
ray sort, so it shouldn't be much slower 
tlian array sorting. 

The first step of Collections,sortt) is to 
convert die linked list Lo an array; this was 
taking 136 seconds in my first test. The 
toArrayi ) function itself turned out to lie 
0(n 2 X apparently because the iterator's 
next() function in LinkedUst restarted at 
the beginning of die list in each call. As 
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a temporary fix, my test program sub¬ 
classes LinkedList 10 overload the itera¬ 
tor) function. The new version merely 
returns listJterator(), with a substantial 
performance improvement, 

Linked-iist Quicksort 

It may come as something of a surprise 
that the quicksort can be done on a singly 
linked list since it is impossible to effi¬ 
ciently traverse backwards. In fact, I 
haven’t encountered any other work de¬ 
scribing a linked-iist partition sort, so I be¬ 
lieve that this is an original algorithm 
(though. Fd be glad to hear from any read¬ 
er who knows otherwise). Listing One 
(listings begin on page 137) is Java sou rce 
for this new sort. 

The basic method uses the first node’s 
key as the pivot. The algorithm travers¬ 
es forward through the list using two ref¬ 
erences called a Node and aNodePrev, 
Nodes with a key value greater than or 
equal to the pivot are ignored. When a 
node containing a lesser key is encoun¬ 
tered, the algorithm deletes aNode by 
connecting aNodePrev, next to aNode, next. 
Then, aNode is pushed onto the front 
of the list, becoming the new first node. 
Once the list is divided into two sub¬ 
lists, it can be sorted by recursively ap¬ 
plying this partitioning procedure to 
each suhlist. 

This still leaves the quicksort's prob¬ 
lem of quadratic performance on sort¬ 
ed/nearly sorted and reversed/nearly re- 
versed data. The simplest and most 
popular solution to this problem is the 
median of three method, which can be 
effected in constant time per subarray 
partition. However, the median of three 
method requires linear time to scan the 
cnlire linked list prior to the partition 
pass. Truthfully, this really isn't such a 
bad idea. Since the partition already takes 
linear time, having the median of three 
method take O(n) time doesn't change 
die 0( fj log n) average case rating of the 
sort. The demo program (available elec¬ 
tronically) provides a median of three 
function for the list quicksort. 

But if linear time is acceptable, then it 
is possible to do something more elegant. 
In the case of die linked-iist quicksort, the 
pivot advancement method (see Figure 3) 
is much shorter to code and easier to un¬ 
derstand, yet it results in much faster per¬ 
formance on random data as well as O(n), 
overall sort behavior on nearly sorted and 
nearly reversed lists. 

Pivot advancement starts with two ref¬ 
erences, Pivot and aNode, at the start of 
the list. The algorithm advances aNode 
through the list as long as it is mono- 
tonic nondecreasing, and every two ad¬ 
vancements of aNode yields one ad¬ 
vancement for the Pivot. If the pivot 


advancement loop traverses the entire 
sublist, then it is sorted so the function 
returns with no further processing. If the 
pivot advancement loop does not tra¬ 
verse the entire loop, then the pivot is 
the median element of the entire mono¬ 
tonic nondecreasing sublist at the start 
of the input list. Furthermore, we leave 
aNode at its given position rather than 
starting the partition loop at the node af¬ 
ter the pivot t>ecause the nodes between 


The quicksort is not 
the quickest 
comparison sort. 
The merge sort is 


the Pivot node and aNode are already 
known not to have lesser key values than 
the pivot. 

Thus, with only one extra comparison 
per partition, pivot advancement handles 
nearly sorted data sets in linear time 
rather than quadratic time. Reversed data 
is also handled in linear time because 
nodes pulled from the second sublist are 
pushed onto the front of the first sub¬ 
list, which reverses them into sorted or¬ 
der. On random data, pivot advancement 
outshines median of three because the 
former does little work in choosing a piv¬ 
ot at or near the beginning of the list 
(rather than traversing the whole list). 
The probability that aNode will be ad¬ 
vanced in random data sets diminishes 
exponentially (1 in 2 i chances of i ad¬ 
vancements). 

Finally, note that die linked-iist quicksort 
presented here still has a quadratic worst 
case. The worst case occurs with sequences 
with die pattern 3,2,1,6,5,4,9,8,7.,, Howev¬ 
er, these worst case examples are unlikely 
to occur in practice, and die expected run¬ 
ning time is G(n log n) with low hidden 
constants, 

Linked-Ltst Merge Sort 

Conventional wisdom has it that the 
quicksort is quicker than all other com¬ 
parison sorts, and the merge sort in par¬ 
ticular, However, this conclusion is gen¬ 
erally based on analyzing die merge sort's 


performance on an array, which is not 
die ideal data structure for die merge sort. 
The merge of two sorted arrays requires 
an auxiliary array, which ultimately re¬ 
sults in the merge sort performing twice 
as many movement operations as the 
quicksort on average. By comparison, the 
linked-iist merge operation can be done 
in-place with only constant memory over¬ 
head for a few temporary variables. This 
eliminates both of the detracting factors 
for the merge sort. In fact, since the 
merge sort performs fewer key compar¬ 
isons than die quicksort on average, it is 
the merge sort that should be quicker 
when sorting linked lists. 

As the empirical results show, the 
linked-iist merge sort (see Listing Two) 
on a linked list is so fast that it outper¬ 
forms the quicksort in Arrays.sort() us¬ 
ing the same data placed in an array. The 
list quicksort presented in this article is 
also slower than die list merge sort by an 
average 6 percent. Furthermore, the 
merge sort's recursive calls always divide 
the list evenly in half. So, given that the 
merge step is a simple linear time pro¬ 
cess, the merge sort can never be slow¬ 
er than 0(n log n). Finally, the guaran¬ 
teed logarithmic function call depth 
means that the merge sort w ill never gen¬ 
erate a stack overflow exception, A worst 
case array quicksort could generate an 
exception in Java with as few as 5000 el¬ 
ements, which is die current size limit on 
Java’s function call stack. 

Nonrecursive Merge Sort 

It is also commonly believed that non¬ 
recursive sorts are better than recursive 
ones. Again, this is due to the quicksort. 
It is better to use a nonrecursive quick¬ 
sort simply to guarantee that no stack 
overflow exception will occur if a worst- 
case data set is encountered. The merge 
sort doesn’t have this problem, so re¬ 
cursion poses no threat. However, a non- 
recursive version does avoid function 
calls, so I coded two nonrecursive meige 
sorts. To my surprise, neither of the non- 
recursive solutions managed to outper¬ 
form recursive merge sort. 

The first nonrecursive method traverses 
the list merging all lists of sizes 2, then 
4, 8, and so on. It properly handles un¬ 
even cases with a minimum of overhead 
(see Listing Three). The second method 
is slightly faster, so its times are the ones 
reported in Figure 1, It is much more 
complicated. The idea was inspired by 
the consolidation step of the Fibonacci 
heap. The idea is to build an array of 
lists, all initially empty. Each list is re¬ 
sponsible for holding the number of 
nodes in successive powers of 2. You 
add the nodes from the original list one 
at a time. Then you add nodes, one at a 
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time, to the 2° list. That list overflows on 
the second addition, ai which point it is 
moved to the 2 1 list, emptying the 2° list. 
Then, if you add another two nodes, the 
2° list overflows again. But this time you 
have to merge it with the existing 2 1 list. 
When done, the 2 l list has four elements, 
so it is moved to the 2 1 list. The proce¬ 
dure is conceptually similar to incre¬ 
menting a binary number. If there were 
15 nudes in total, then the first four lists 
(2 W , 2 1 , 2 2 , and 2 3 ) would be full. If you 
then add a 16th node, then successive 
merge operations would join all nodes 
into the 2 4 list, emptying ail other lists 
in the process. 

Though more elaborate, this method 
is faster than the first nonrecursive merge 
sort because it does not traverse any sub- 
list twice, 1 also built a second merge 
function because this sort didn’t need 
some of the parameters and extra work 
of the recursive sort's merge function. 
Despite this, the second nonrecursive 
merge sort did not run faster than the 
recursive merge sort. The empirical re¬ 
sults in Figure 1 show the nonrecursive 



Figure 1; Sort comparison. 
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merge sort to be 0.3 percent slower than 
the recursive version, buL it is really too 
close to call. 

Sort Stability 

The merge son provides stability, a fea¬ 
ture that the quicksort doesn't provide 
without extra programming (including 
the army sort in the JDK). Stability in this 
case refers to a sort's ability to keep 
equivalent keys in the same order that 
they appeared in the unsorted data. The 
test program has a few r lines of com¬ 
mented code that can lie uncommented 
to demonstrate stability. The test simply 
adds the field Pos to each node. Before 
sorting, the numeric position of each 
node is recorded* After the sort, the key 
data and Pos value of each node is print¬ 
ed, After the merge son, the position val¬ 
ues for equivalent keys are always as¬ 
cending, but they are in random orders 
for the list quicksort. To make a stable 
quicksort, you could include Pm as a sec¬ 
ondary key, but this slows die sort and 
requires an extra field in the node struc¬ 
ture, With the merge sort, stability is free. 

Binary Search 

According to the empirical results, the 
linked-list binary search (Listing Four) 
was the biggest winner. While It's true 
that a binary search requires sorted or¬ 
der, this is often required anyway. And 
even when it’s not, the results show r that, 
in java, the cost of sorting is paid for 
with relatively few searches (for exam¬ 
ple, about 25 searches on a 4000 ele¬ 
ment list). 

The algorithm starts by setting Parti- 
HonFirst equal to the first list element 
and PartitionSize equal to tire number 
of list nodes, to each iteration of the out¬ 
er loop, the inner loop traverses to the 
list’s middle node* If the search key is 
less than the key at the middle node, 
dien the search should continue in die 
first half of the list. So, the algorithm 
merely reduces the PartitionSize because 
the PartitionFirst already refers to the 
start of the desired sublist. If the search 
key is greater than the middle node’s 
key, then PartitionFirst is set equal to 
die node after the middle node, and the 
PartitionSize is set equal to die count of 
nodes after the middle node. 

On average, die sequential search does 
n/2 node hops and n/2 comparisons. 
The number of node hops in a binary 
search is never less than n/2 but is al¬ 
most always at or near n , Despite the 
fact that the binary search does nearly 
twice as many node hops, the binary 
search is over nine times faster than a 
sequential search because the binary 
search does only log(n) comparisons. 
The linked-list binary search pays for it- 


Figure 3: Singly linked list quicksort. 

self if key comparisons are more com¬ 
plicated than two or three node hops. 
This is true of many data sets—for ex¬ 
ample, search by name requires string 
comparisons— but key comparisons are 
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UNDOCUMENTED CORNER 


VME: Coming Out of the Cold 

Robert R. Collins 


I n the first two installments of this se¬ 
ries on Intel's Virtual Mode Extensions 
(VME), I examined why VME is need¬ 
ed, More specifically, in die January 
column I explained the performance 
problems associated with Intel’s original 
v86 mode implementation. I went on to 
discuss Intel’s new Enhanced v86 Mode 
implementation, in particular, how Ev86 
mode fixed many of the performance de¬ 
ficiencies in the first v86-mode imple¬ 
mentation. 

In the March installment, I went under 
the hood of VME, examining all of the 
various components which are used by 
the VME, and explained how they work. 
At the end of that column, l presented 
source code that can lie used to model 
your own Ev86 mode tasks. 

Subsequent to those columns being 
pu blished, I ntel re 3 eased the In tel A rcb i- 
tecture Software Developer's Manual, Vol¬ 
ume 3- Although six months late, the man¬ 
ual couldn't have Ixeen released at a better 
time because it gave me the opportunity 
to determine whether Lntel had finally doc¬ 
umented the missing details of VME or de¬ 
scribed its many caveats. 

Alas, Intel 1ms done neither The missing 
details are still missing, and the caveats of 
VME have never been descril^ed. Notw ith¬ 
standing these omissions-, Intel lias actual¬ 
ly removed some vital VME information. 

This month, among other things, I'll 
show how many references to VMEs re¬ 
mained in the Pentium manuals after In¬ 
tel’s lawyers had forced their removal. 
These oversights allowed the Appendix H 
liberation movement to reverse engineer 
the VME details and release the informa¬ 
tion six months before Intel. 

The Appendix H Liberation Movement 

After the creation of the notorious Ap¬ 
pendix H, you would have thought Intel 


Robert is an independent consultant on 
the x86 architecture. He can he reached 
at rcollins@x86.org. 


had removed every reference to the ' se¬ 
cret' 1 VMEs (see my November 1997 col¬ 
umn for a description of Appendix H). 
However, Intel wasn’t careful enough. In 
fact, I found many references to VME scat¬ 
tered throughout the Pentium manuals. 

When preparing to help reverse engi¬ 
neer the VME details, I started with the 
premise that the Pentium manual was vir¬ 
tually silent on the subject of VME, In my 
research, I found many more references 
than expected. I extracted all the refer¬ 
ences and began categorizing them, As 1 
rearranged die quotes into categories, 1 
found the Pentium manual told its own 
VME story. Tills story starts by listing some 
of the problems with the existing v86 
mode. Next, the story establishes that new' 
VMEs were implemented to enhance per¬ 
formance. By the end of the story, the 
manual tells you virtually every aspect of 
VME. From this information, any capably 
equipped engineer can write code Lo fig¬ 
ure out the various implementation details. 

Here’s the Pentium manual’s own sto¬ 
ry of why VME was needed, and how to 
use it. Unless otherwise noted, all of the 
following quotes are taken from the Pen¬ 
tium Processor Family Developer's Manu¬ 
al Vain me 3- A rcb iteciure a nd Progra m - 
ming Manual (Intel part number 241430). 

• "Many 8086 programs written for non- 
multitasking systems set and clear the 
IF flag to control interrupts. This may 
cause problems in a multitasking envi¬ 
ronment. As a result, virtual monitors 
running on the Intel386 and Intel486 
processors require maintaining a virtu¬ 
al interrupt flag in software. All instruc¬ 
tions affecting the IF flag trap to the 
virtual-8086 monitor for emulation on 
these processors. 11 (Section 22.5.) Trap¬ 
ping Lo the v86 monitor causes consid¬ 
erable microprocessor time to be spent 
in the overhead of faulting and returning 
control to the program under emulation. 

* “The Pentium Processor includes ex¬ 
tensions to its virtual-8086 mode of op¬ 



eration that improve the performance 
of applications by eliminating the over¬ 
head of faulting to a virtual-8086 mon¬ 
itor for emulation of certain operations," 
(Chapter 22.) 

* “By enabling the virtual mode exten¬ 
sions, the virtual-8086 mode perfor¬ 
mance of the Pentium processor is sig¬ 
nificantly improved.” (Section 22.10,) 

* “Existing interrupt Hag sensitive in¬ 
structions provide significant perfor¬ 
mance improvement when using the vir¬ 
tu al mode extensions of the Pentium 
processor." (Section 22.1,1.) 

* The interrupt flag sensitive instructions 
which “eliminate die overhead of fault¬ 
ing to a virtual-8086 monitor for emu¬ 
lation 11 are CLI, STI, PUSHF, POPF, I NT 
n, I RET, (Section 10X39 

* “When TOPI =3, interrupts are serviced 
by he protected-mode interrupt service 
routine in a manner compatible with the 
Intel486 processor. On the Intel386 and 
Intel486 processors, all INT n instruc¬ 
tions running in virtual-8086 mode re¬ 
quire interception by the virtual-8086 
monitor when IQPL is less than 3, For 
information on Pentium processor vir¬ 
tual mode extension support of inter¬ 
rupt handling, see Appendix H." (Sec¬ 
tion 22,6.) This statement is very 
significant because it essentially tells the 
reader that TNT-ra instructions running 
in Pentium Ev86 mode aren't required 
to be intercepted by the virtual-8086 
monitor. 

* Setting CR4.VME=1 “enables support for 
a virtual interrupt flag in virtual-8086 
mode.” (Section 10.1.3, Section 22,10.) 

* “The Pentium processor T5S contains 
additional information used in virtual- 
8086 mode by the virtual mode exten¬ 
sions to the Pentium processor," (Sec¬ 
tion 23.2.17.1.) 

* “The fields of a TSS are divided into two 
main categories:..,(We are only con¬ 
cerned with the second category.) 2. 
Static Fields the processor reads, but 
does not change. These fields are set 
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up when a task is created. These fields 
store:...The base address for the I/O 
permission bit map and interrupt bit 
map. The base address points to the be¬ 
ginning of the I/O map and the end of 
the 32-byte interrupt map...See Chap¬ 
ter 22 for more information about the 
interrupt redirection." (Section 13.1.) This 
statement clearly tells us that there is a 
32-byte interrupt redirection bit map 
stored in the TSS, whose tail is pointed 
to by the I/O permission bit map base 
address, 

• The EFLAGS register contains two flag 
bits (VIF, VIP) used by VME. (Section 
3.3*4* I, Section 10,1.1, Section 23,2.9, 
Section 23.2.9.1. Lns tru ction description 
for IRET and POPF.) 

• VIF and VIP will always be clear on pro¬ 
cessors that don’t support VME, (Intel 
source code for CPUID detection.) 

• “The VIP flag together with the VIF en¬ 
able each applications program in a mul¬ 
titasking environment to have virtual¬ 
ized versions of the system’s IF hag." 
(Section 10.1.1.) 

• 'The VIF is a virtual image of IF (the in¬ 
terrupt flag) used with VTP/’ (Section 
10 . 1 . 1 *) 

• CU, STI, PUSHF, POPF, INT-n, and 1RET 
are all affected by enabling VME. (In¬ 
struction description for CL1, STI, PUSHF, 
POPF, INT-n and IRET.) 

• IRETD or POPFD never modifies VIF 
and VIP, (Instruction description for IRET 
and POPF.) 

• The VIF bit “operates between a STI 
slate for enabling die execution of in¬ 
terrupt instructions initiated by [a] pre¬ 
viously written program and a CLI state 
for disabling the execution of [the] in¬ 
terrupt instructions." (UK Patent Appli¬ 
cation GB 2 259 794 A t Claim La.i.) 

• The VIP bit “operates between a pend¬ 
ing state during which interrupt requests 
are awaiting execution and a nonpend¬ 
ing state in which no such interrupt re¬ 
quests are awaiting execution/* (UK Patent 
Application GB 2 259 794 A, Claim l.a,ii,) 

• The Pentium contains the means to let 
die operating system “(change) die state 
of the VIF bit of the EFLAGS register 
withouL die use of emulation software, 
so long as the VIP bit is in its non- 
pending state." In other words, when us¬ 
ing Ev86 mode, CLI and STI modify VTF, 
without blocking or enabling external 
interrupts, and without modifying the 
actual interrupt flag (IF). (UK Patent Ap¬ 
plication GB 2 259 794 A f Claim Lb.) 

• Once VIP is set into EFLAGS while VIF 
is in the nonimermptible state, die Pen¬ 
tium will respond to an STI instruction, 
by “automatically executing the STI in¬ 
struction and awaiting an interrupt re¬ 
quest by means of the emulation software 
without first changing the state of the VIF 


bit in EFLAGS from a Noninterruptible 
state to an Interruptible state/’ (UK Patent 
Application GB 2 259 794 A, Claim Lc) 

As Eve already mentioned, this infor¬ 
mation was publicly available at die time 
Intel was trying to keep it secret. Anybody 
widi the necessary skills could use this in¬ 
formation to reverse engineer the re¬ 
maining details of Intel’s VME implemen¬ 
tation. In the long run, keeping this 
information secret hurt the engineering 
community more tiian it helped Intel main¬ 
tain any competitive advantage. 

VME Details Finally Released 

When the Pentium Pro manuals were fi¬ 
nally released in April 1996, Intel released 
its own description of VME. By this time, 
my article “Pentium’s Virtual Mode Ex¬ 
tensions Revealed" (coauthored with Jim 
Brooks, EE Times, November 11, 1995) 
describing VME had been publicly avail¬ 
able for five months* Upon reading In¬ 
tel's VME description, I noticed that some 
of it is fatally flawed. Consider the fol¬ 
lowing excerpt from the Pentium Pro 
Family Developers Manual, Volume 3, 
section 12*3*5: 

If the processor receives an interrupt or ex¬ 
ception and die VIF flag is dear (maskable 
hardware interrupts enabled), the proces¬ 
sor performs the same operation as it does 
for method 5 interrupt and exception han¬ 
dling (that is, it redirects handling to die 
8080 program's interrupt and exception 
handlers). The processor also handles in¬ 
terrupts and exceptions in this manner if 
the VIF flag is set, and the processor re¬ 
ceives either an NMT interrupt or an ex¬ 
ception (interrupt vectors 0 through 18). 

If the processor receives a maskable l lard- 
ware interrupt (interrupt vector 32 through 
255) when the VIF flag is set, processor 
performs and the interrupt handler software 
must perform the following operations: 

In some way or another, almost every sen¬ 
tence in this quoted text needs correction. 
Hie passage should read as follows: 

If die processor receives an INT-w instruc¬ 
tion, the processor performs the same op¬ 
eration as it does for method 5 interrupt 
and exception handling (that is, it redirects 
handling to the 8086 program's interrupt 
and exception handlers). The processor 
never handles maskable hardware inter¬ 
rupts, exceptions, or an NMI interrupts in 
this manner. 

If the processor receives a maskable hard¬ 
ware interrupt, exception, or an NMI in¬ 
terrupt when the VTF flag is set, the pro¬ 
cessor causes a #GP(0), Otherwise, when 
the VIF Hag is dear die processor and in¬ 
terrupt handler software must perform the 
following operations: 

I was so confused by Intel's docu¬ 
mentation that I’m still not sure what it 


was trying to say. Therefore I made my 
best guess, while making the technical 
corrections along the way. To determine 
if Intel has corrected these errors, I 
checked this same text in the newly re¬ 
leased Intel Architecture Software Devel¬ 
oper T s Man uat, Volu me 3, discovering that 
Intel has completely removed this entire 
section of text, and the detailed expla¬ 
nation that followed. 

The following text was removed from 
Intel’s manuals. The text is the continua¬ 
tion of the passage just listed. The text 
describes the steps which the micropro¬ 
cessor and Ev86 monitor must perform 
when a maskable hardware interrupt, ex¬ 
ception, or an NMI interrupt occurs when 
EFLAGS/VTF=0. 

I The processor makes a call to a protected 
mode interrupt handler as described tn 
the following steps. These steps are al¬ 
most identical to those described for 
methtxJ I interrupt and exception han¬ 
dling in “Handling a Virtu a1-8086 Mode 
Interrupt or Exception Through a Pro- 
tected-Mode Trap or Interrupt Gate" on 
page 12-17: 

a. Switches to 32-hil protected mode and 
privilege level 0. 

b. Saves the state of the processor on the 
privilege-level 0 stack. The states of 
the EIP, CS, EFLAGS, ESP, SS, ES, DS, 
FS, and GS registers are saved (see 
Figure 12-5 on page 12-18). In the 
EFLAGS image on die stack, the IOPL 
field is set to 3 and the VIF Hag is 
copied to the IF flag. 

c. Clears the segment registers, 

d. Clears the VM dag in die EFLAGS reg¬ 
ister. 

e. Begins executing selected the pro- 
tected-mode interrupt liandler. 

2. The recommended action of the pro- 
tected-mode interrupt handler is to read 
the VM flag from the EFLAGS image on 
the stack. If this flag is set, the handler 
makes a call to the virtual-8086 monitor. 

3. The virtual-8086 monitor reads the VIF 
Hag in the EFLAGS register* If the flag 
is set, the virtual-8086 monitor sets the 
VIP flag in the EFLAGS register to in¬ 
dicate that there is an interrupt pend¬ 
ing and returns to the protected mode 
handler. 

4. The protected mode handler executes a 
return to virtual-8086 mode. 

5. Upon reluming to virtual-8086 mode, the 
processor continues execution of the 
8086 program without handling the in¬ 
terrupt. 

When the 8086 program executes the STI 

instruction to clear the VIF flag, die pro¬ 
cessor does the following: 

1* Checks the VTP flag. 

a. If the VIF flag is dear, the processor 
clears the VEF flag. 

b. If Lite VIP flag is set, the processor gen¬ 
erates a general-protection exception 
(#GP), 
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2. The recommended action of the 
protected-mode genera l- protection ex¬ 
ception handler is to then call rhe virtual- 
8066 monitor and let it handle the pend¬ 
ing interrupt, 

A typical action of the virtu a1-8086 mon¬ 
itor is to clear the VTF and VTF flags in the 
EFLAGS image on the stack and execute a 
return to tile virtual-8086 mode (through 
the protected-mode exception handler). 
The next time the processor receives a 
maskable hardware interrupt, (providing 
the V1F flag is still dear) it will handle it in 
the same manner as with method 5 inter¬ 
rupt and exception handling. 

Note that the states of the VTF and VTP 
flags Lire not modified in real-address mode 
or during transitions between real-address 
and protected modes. 

More Information: Missing in Action 

Intel has never released the algorithms 
used by the various CPU instructions need¬ 
ed to support VME, Perusal of Intel's Pen¬ 
tium manuals shows that GU t STI, PUSHF, 
POPF t INT-n, and 1KET have all lieen mod¬ 
ified lo support VME. This support is nec¬ 
essary, as all of these instructions have the 
ability Lo modify the Interrupt Flag in the 
EFLAGS register (EFLAGSJF). With VME 
enabled, the processor must now act on 
the Virtual Interrupt Flag (EFLAGS.VIF), 
instead of Lhe real IF flag. 


To the best of my ability, I’ve made List¬ 
ing One (listing begins on page 138) rep¬ 
resentative of the algorithms Intel uses in 
its Pentium VME implementation. I’ve only 
included the algorithms for CLI, STI, 
PUSHF, POPF, and IRET. I never attempt¬ 
ed to reverse engineer the INToz instruc¬ 
tion. The INT-ra instruction should close¬ 
ly resemble the PL^SIIE instruction, with 
tlie additional support needed for the In¬ 
terrupt Redirection Bitmap, The code is 
written in pseudo-C format, but is intended 
to be human readable. 

VME Caveats (When CR4.VME=1) 

VME is wonderful for reducing the com¬ 
plexity of the operating system, but has 
nlimy caveats. Some of the quirks are nat¬ 
ural extensions of VME or die underlying 
architecture, while others make little 
sense. 

* While an Ev86 task is running, the IF- 
sensitive flags commit fakery to fool 
the Ev86 task into thinking that it us ac¬ 
tually setting and clearing Lhe IF flag. 
When a fault occurs to the monitor, the 
monitor sees the actual values of V1F, 
VIP, and IF untiltcred. This behavior 
gives the monitor direct control over 
virtual interrupts, and hardware inter¬ 
rupts. 


* Software-generated exceptions are not 
influenced by die IR bitmap. Instead, 
these opcodes will always invoke the 
protected mode exception handler. 
Software-gene rated exceptions are as 
follows: 

1. INTI, also known as ICEBP, (An un¬ 
documented opcode, OxFl, which 
generates an INT-1 exception.) 

2. LNT3 (opcode OxCC), 

3. INTO (opcode OxCEX 

4. BOUND (opcode 0x62). 

* All processor exceptions invoke die pro 
tected mode exception handler. The LR 
bitmap never influences processor ex¬ 
ceptions. 

• Maskable hardware interrupts and NMI 
interrupts invoke the protected mode 
exception handler when EFLAGS. VIF^O, 
When EFLAGS.VIF-1, maskable hard¬ 
ware interrupts and NMI interrupts cause 
a #CP. The IR bit map never influences 
these interrupts. 

• VTF will never transition from 0 to 1 
while a virtual interrupt is pending when 
IOPL<3. Tlus condition will always cause 
a general-protection fault (#GP(0)) be¬ 
fore V1F is set. This can best be de¬ 
scried as follows: While the Ev86 task 
is in an uninterruptible state (VTF=QX a 
timer tick occurs, and causes a transb 
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tion to ilie protected mode interrupt 
handler. The timer-tick routine sets VIP 
on die stack frame, and returns to the 
Ev86 task. The Ev86 task performs an 
STT instruction, which will put the pro¬ 
gram into an interruptible state. Before 
VTF is actually set, the #GP occurs, (This 
behavior can be observed by inspect¬ 
ing the EFLAGS image on the CPL-0 
stack frame.) Tills behavioral character¬ 
istic is die basis for Intel's British Patent 
application, which claims “if the VTP bit 
is in its ‘pending' state, then the STT in¬ 
struction can lie executed and an await¬ 
ing interrupt is serviced by using the 
emulation software without first chang¬ 
ing the VTF bit from its CL1 state to its 
ST1 state." 

• VIP functions in IOPL=3 in a strange 
manner. When IOPL<3, and VIP=L any 
transition from VTF=0 to VIF=1 will 
cause a general-protection fault before 
VIF is actually set. When IOPL=3, this 
behavior is different. VIF must actually 
be set before the #GP occurs. 

• PUSHFD, POPFD, and IRETD will all 
generate a general-protection fault in an 
Ev86 task when IGPL<3. 

• The behavior of POPF and IRET is in¬ 
consistent in how it handles the trap 
flag (TF) from the FLAGS image on the 
stack. When POPF is invoked widi TF=1 


on die stack image, a general-protec- 
don fault occurs. When IRET is invoked 
under the same condition, the gener¬ 
al-protection fault does not occur. It 
has been recently reported that new¬ 
er Pentiums do not demonstrate this 
behavior. If true, then Intel has uncer¬ 
emoniously fixed this behavior, and 
failed to mention this fact in their er¬ 
rata documents. 

* IRET is no longer IOFL- sensitive when 
CR4.VME=1. Consider that the Ev86 task 
is running and an interrupt's associated 
IR bit is set. When this interrupt is in¬ 
voked, it will cause a fault to the mon¬ 
itor, or the protected inode interrupt will 
be invoked (depending on IOPLL The 
interrupt is then reflected back to the 
Ev86 task. When IOPL=3, the behavior 
of the fault and subsequent IRET match¬ 
es non-Ev86 behavior. When 1GPL<3, 
the subsequent IRET from the interrupt 
service routine doesn't cause a fault back 
to the monitor, as IRET is no longer 
IOPL-sensitive. This behavior is incon¬ 
sistent with its non-Ev86 counterpart. 

* When IOPL=3* die CPU never updates 
VTF, though software can manipulate it 
in CPL-0. This can lead to the next 
caveat, r . 

* The CPU will generate a genera 1- 
protedlon fault (#GP(0)) whenever VTF=T, 


and VIP=1 and running in CPL-3. This is 
regardless of whether or not the code 
is an Ev86 task. 

• Tn CPL-0, IRFTD has the ability to set 
VTF and VIP, but POPFD does not. 

Conclusion 

I have described everything I know about 
VME in this series, but I’m sure that even 
so, not everything is known and docu¬ 
mented. For example, since I wrote my 
last VMF column, J discovered a few 
more caveats. As time goes on, I’m sure 
more VME details will be discovered. Yet 
in these three articles, Tve presented 
many Limes more information than is con¬ 
tained in Intel's manuals. In the release 
of its newest architecture manual, Intel 
has demonstrated that it wishes to say 
less about VME, thereby removing vital 
information from the engineers who need 
it most. Therefore, consider these articles 
a blessing, and save the source code con¬ 
tained within. You never know when 
you 11 need it—and you'll never know 
when Intel will release any more VME 
details. 


DDJ 

(Listing begins on page 138.) 


In too many software development projects, It’s every' developer for 
himself. Which is why more teams are turning to PVGS Professional 
from INTERSOLV. Comprehensive, open, scalable, PVGS Professional 
brings order to the entire development process. It fully automates the 
management and communication of changes, even across platforms and 
the Internet. This means teams will always be in sync, from any location 

The only aspect of enterprise-wide software 
development that happens automatically. 

enterprise-wide. With everybody pulling in the same direction for a 
change, teams arc free to create more innovative software, fasten 
Democracy triumphs again I For complete SCM solutions for your 
development teams (including die just released Version Manager 6.0), 
call 800-547-7827 or any authorized INTERSOLV reseller. 

Or visit wwwjntersolv.com/pvcs today. 



vt r 

t ' , ‘1% f 

PVCS VERSION MANAGER 

PVCS TRACKER" 

PVCS CONFIGURATION BUILDER 

. pvos T 

The worldwide standard for organizing 

Captures, manages and communicates 

Automates and accelerates error-free 

PROFESSTtmAL j 

managing and protecting your enterprise 

issues throughout your entire enterprise. 

software huitds across platforms. 

- „ V L 

software assets. 


J 


AD LINK 227 


Dr. Dobb’s Journal, May 1998 


133 












* 


DR. ECCO'S OMHIHEURIST CORNER 

^ m— 


Nimmerics 



Dennis E. Shasha 


I was just figuring out how 1 could force 
a draw in our chess game, when the 
doorbell rang. It rang again almost im¬ 
mediately. Then again. 

"Must be the Feds,” Ecco said with a 
sigh. “No patience at all” 

Ecct/s 10-year-old niece Liane, who 
had been watching our game with an 
ironic smirk, said "Ill get it,” and ran to¬ 
wards the door. 

Federal agent Thomas flashed his ID 
badge to all three of us, took a seat with¬ 
out being invited, opened his briefcase, 
pulled out a folder, and began to speak, 
“We were keeping Wun derma mi un¬ 
der observation. Our agents saw him yes¬ 
terday morning. By the afternoon, he 
was gone. All he IcfL was tills note. The 
Lrouble is that we don 1 ! know what to 
make of it,” 

Wim Wundermann was a profession¬ 
al colleague of Ecco s. Fie had dropped 
out of sight a few months earlier, saying 
that he had discovered patterns he called 
"Nimmerics." Ecco told him that he 
should steer dear of applications to en¬ 
cryption. 

"Well what does the note say?” Liane 
asked. 


Dennis, a professor of computer science at 
New York University, is the author 0 / The 
Puzzling Adventures of Dr. Ecco (Dover, 
1998), Codes, Puzzles, and Conspiracy 
(W,H< Freeman & Co. t 19921 Database 
Tuning: A Principled Approach (Prentice 
Halt\ 1992% and (coauthored with Cathy 
Lazere) Out of Their Minds: The Lives and 
Discoveries of 15 Great Computer Scien¬ 
tists (Springer Verlag T 1997), He can be 
contacted at DrEcaMddj. com . 


Agent Thomas showed us the paper, 
and we all noticed that the writing was 
strange — especially at the beginning of 
the note. 

This is useful information for any Nim- 

Wise player. You know how to Play this 

Easy game of Nim Though it is not as easy 

as it seems. 

"Oh, yes, 1 remember," Liane inter¬ 
rupted, 

"You start with a bunch of stones and 
you can take one, two, or three of them 
in your turn. You win if you remove the 
last stone.” 

"Right,” said Ecco, "and do you re¬ 
member the winning strategy?” 

Liane answered, "You try to end each 
turn leaving zero, four, eight, 12, 16,... 
(any multiple of four) stones to the oth¬ 
er player. Then you force the number of 
stones to exactly four less at the end of 
your next turn.” 

“What would happen, Liane,” Ecco 
asked, "if each player could remove one 
more stone than the last player? So, if 
the First player can remove up to three 
in the first move, then the second can 
remove up to four in the second move, 
and the First can remove up to five in 
the third move, then the second can re¬ 
move up to six in the fourth move, and 
so on?” 

“Haimm,” Liane paused for a moment. 
“The second player would still win if 
there were four stones at the beginning, 
Fait would also still win if there w ere live, 
since he'd be able to take up to four m 
the second move. Ill have to think about 
this some more/ 


"May I continue?” Federal agent Thomas 
demanded. “We are talking alx>ut the dis¬ 
appearance of a major national asset here 
and...” 

“Yes, of course,” Ecco apologized 
(though l thought 1 detected a trace of 
mockery in his voice). ’Please go on," 
Thomas read on, but in a louder, slow¬ 
er voice. 

I will give you two puzzles. Suppose I 
present a bowl containing one $1 hill 
two $2 hills, three !n Dills, and four S10 
bills. Play consists of moving a single 
bill at a time from the howl to the table, 
which has no bills iniUally, until the 
amount on the table reaches or exceeds 
a given target number. If a player s move 
makes die amount reach that target, ihen 
that player wins. If a player s move 
makes die amount exceed that target, 
however, then that player loses. 

For example, if the target number is 
three, then tile second player wins, be¬ 
cause the first player will either exceed 
three dollars on his first move or will 
place a $1 or $2 hill on the table, in 
which case the second player will make 
the amount equal three dollars on her 
next move. 

“Yes, but the first player usually has 
an advantage,” Liane observed. 

Thomas glared at her, but Ecco looked 
at her curiously. 

“Oh, it just feels that way," she said 
with a shrug. 

“It s true for Nim,” she added. 

Ecco nodded, but then turned his eyes 
to agent Thomas. 

Thomas read on with even more de¬ 
liberation. 
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Let X he the first target number 29 or 
greater in which the second player can 
force a win. Let Y be the first target 
number 41 or greater in which the first 
player can force a win. Find me at that 
location. 

Can you find Wundermann *s address 
at X and Y? Does either X or Y change if 
there are twice as many $ Is, $2s, $5s t 
and $10s? If so } to what? 

Ecco nodded his 
head thoughtfully, 
then said “Okay, so 
let s play Wunder¬ 
man ns game, Miss 
Feels-That-WayT 
While Thomas 
paced the room, 

Ecco worked out 
the problem by 
playing the game 
with Liane, After he 
gained enough in¬ 
tuition , he gave 
Thomas the answer. 

"These two num¬ 
bers are the X and 
Y you are to look 
for, perhaps they 
are street and av¬ 
enue addresses. 

Liane has a theory 
about the letters,” 

“NWFENT," she said. Took at the be¬ 
ginning of the note. Those are the weird 
uppercase letters. The others are there 
by grammar." 

Federal agent Thomas stared at her as 
if puzzled for a moment, then his face 
cleared. He collected his papers, stood 
up, and left without a word, 

A few days later, Thomas was back. 
"We found the place. A penthouse apart¬ 


ment in the Northwest district near the 
street comer you identified," Thomas said, 
nodding to Liane and Ecco, "Now, Wun- 
dermann has left a new puzzle. The guy's 
insane,” 

Consider a variation of Ntm which I II call 

Expanding Nina* In that variation,., 

Thomas paused, then said, "This new 
game is exactly the one Ecco asked Liane 
about the other day. ft is the standard 
Nim except that, for 
the first move, the 
first player can take 
one, two, or three 
stones; in the sec¬ 
ond move, the sec¬ 
ond player can take 
one, two, three, or 
four stones; in the 
third move, the first 
player can take one, 
two three, four, or 
five stones; in the 
fourth move, the 
second player can 
take one, two, 
three, four, five, or 
six stones, and so 
on. The possibilities 
expand at each 
move, making this a 
pretty difficult game 
to play,” 

“Well, Liane, can you do it?” Federal 
agent Thomas asked. 

"Sure,” she answered cheerfully. "Here 
they are... 1 ® 

Can you find Dane's solution as well as 
the solution to the question posed earlier 
in italics? If so, let me kno w at DrEcco® 
ddj.com . 

DDJ 


The second player 
would still win if there 
were four stones at the 
beginning, but would 
also still win if there 
were five 


Last Month's 

Solutions 

Eccp believes in symmetry* so ills solutions were to have the spacecraft land 

at the following coordinates: 


For 7 spacecraft: 

For 6 spacecraft: 

(250 250.0 

( 250 250.0 

250 500.0 

250 500.0 

250 750.0 

250 750.0 

500 500.0 

750 250.0 

750 250.0 

750 500.0 

750 500.0 

750 750.0) 

750 750.01 


1 don’t think these are the possible solutions, however. See if you can do 

better. The generalization to any number pf points is an open problem so far. 
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1000*3 of High Tech Jobs in 
your area @ YOUR 
address for SUCCESS: 


to wgyks 

The Computer Job Center 

Job Seekers 

Job SubmltfUpfate 

Smirch a Resume 

Itra^Sb PewYT ClCh 
morthei TOUR 





Submit o resume on-tine 


Employers, 


ln[w*prks 
k Information 


Member 

Entrance 


Take 
a Test 


Coni at t 
Us 


www.IT123.com 

The only rogion^ify-specific technical career site on 
the iveti at no cost to youf" 

Fax: 1-303^663-8145 E mutt; info^lnfaworksusa.CDm 


Sexy 


THE SHAY GROUP QE 


Visual C+ + Delphi PowerBuilder Smalltalk Orach! 


Job lost its sex appeal? 

Bored? Need a new challenge? 

Want more dough? 

Ready to tell your manager @#$I 
Do you really need the headache? 

Let us help. At Peak Career Mgmt., 
we specialize in recruiting software 
developers (C/C++, Windows, UNIX, 
Device Drivers) end other sof tware 
industry professionals across the US. 

Our services are free to applicants. 
Positions available nationwide. 

Contact us today at 
dobbs@peakweb,com 
(303)316-0600, FAX (303)316-0700 


www. peakcareer. com/peakcareer/ 



S/W ENGINEERING POSITIONS NATIONWIDE 

We Understand 
The Programmer’s 
MindT 

When the country s 
top firms look for the 
best developers 
available, they turn 
to Bateman. Why? 
Because we spe¬ 
cialize In MS Win¬ 
dows. NT. JAVA and 
Macintosh recruiting 
nationwide So if it's 
time for a career 
move, give us a call. 

We understand your 
skills, and the mar¬ 
ketplace for them.., 
we understand you. 

# Bateman Inc. 

711 Daily Dr.. Suite 106 
Camarillo, CA 93010 
Tel: 805-383-3330 Fax: BOB 383 3337 


f RSVP SERVICES ^ 

Employment Service 

trusted by computer professionals 
and employers since 1966 

Clients and/or affiliates in all regions 
of U.S. and Canada * All platforms and 
technologies ■ Employment fees and 
relocation assistance paid by 
employing organizations 

APPLY ONLINE: 

www.rsvpjobs.com 

or call (toll free), mail, fax or email 
resume: Howard Levin, Dept DDJ, 

RSVP Services, PO Box 8369, Cherry 
Hill NJ 08002-0369 
Phone: 800/222-0153 
Fax: 609/667-2606 
Email: hl@rsvpjobs.com 


ytrr 


♦ . . Gui, GUI, GONE 

If you’re a software developer 
skilled in designing graphical user 
interfaces with: 

• Visual C++, 

»Powerbuilder, 

* Delphi, Smalltalk, or 

• Oracle Developer 2000 
... and you're ready to make a 
change in your employment 
picture- double click here-your 
window of opportunity. 



THE SHAY GROUP 
voice: 800/894-2888 
fax: 413/256-8582 \ 

75114 2251 @compuserve com 
http ://www. shay snet. com/- shay gl 



Great Jobs 
in Lake Tahoe! 

Zephyr Associates develops I 
the leading investment! 
analysis software for the institutional! 
investment market Located in one of the most I 
beautiful places in the world, we are minutes | 
from skiing, hiking, and boating. 

We are looking for experienced Senior C++1 
application developers to join team embarking I 
on next generation of product. Proficiency I 
developing Windows applications using MFC I 
is required. Experience in object oriented [ 
design and development desired. 

Wc offer an attractive: salary, benefits, and an 
outstanding retirement plan. 

Zephyr Associates, Inc. 

E-mail: aaron@styleadvisor.com 
w wwjty lead visorcom/j obs.htmi 


Software Careers Ad Links 

Ameritech ... Ad Link 700 

Anderson Consulting . ....Ad Link 7011 

IBASE Consulting. .. Ad Link 702 

Bateman, Inc ..... ...Ad Link 703 

The Computer Jobs Store . Ad Link 704 

D & L Online ..... Ad Link 705 

Global Resource Management.,Ad Link 706 
Hamilton Technical Personnel,...Ad Link 707 

Infoworks USA .. ...Ad Link 70S 

Micron Communications, Inc, ,..Ad Link 706 

NuMega Technologies ... Ad Link71C 

Peak Career Management . Ad Link 7111 

Qualcomm .... Ad Link 71 % 

Riccione & Associates, Inc....Ad Link 71 : 

RSVP Services . ..Ad Link 714| 

Scientific Placement, Inc . Ad Link 71 bl 

The Shay Group . Ad Link 71 el 

Spectrum Concepts Consulting. Ad Link 7171 

User Technology Assoc., Inc . Ad Link 71 bl 

Zephyr Associates . .Ad Link 7191 



































































DDX/DDV 


Listing One 

//(tm DATA(EGCircleJ 
etUM [ IDO = inO.ClUCLIt ] i 

CEtting m . Px; 

CString m_Py: 

CString in.Radius; 
//J]AFX_DATA 


Listing Two 

//[ [AFX.DATA(EGCirule) 
enum [ IQD = IDD_CIRCLE )j 
CMflcro di Fx; 

CMaprp m_Py; 

CMacto m. Radius]; 

// 3 ] AFE. DATA 

Listing Three 

void ECCirclBiiSerislistefCArcliiT'&Sr ar) 
t 

if iar,r&StorinfiO) 
t // storing cade 
ar « b..Px 
<< n Py 
<< W-RadiuB; 

J 

else 

[ // loading Cade 

ar m„Px 
>> ta_Py 
S5 m_RadiuB; 

3 

) 

Listing Four 

CMFEleBi ECCirclei :operator^(const CKFElemi elRHE) 

t 

CMFElem: :operator={eIRR3); // Call base class assignment 1st. 

if (this = &elRH$) //Nothing to do I 

return *tMs; 

n_Fx - ({ECCirclefi} elRHE).m.Px: 

a.Py » ((RGCixcW alRHSj.B.Py; 

■-'Radius = (CEGCtrcleS) elRHE), ■_Radiua: 
return *thls: 


C PROGRAMMING 


Listing One 

// - plsymidi.cpp 

finclude tfstrcami 

m include <iostreaa> 

Sinclude "aidifilo.ir 


clast WT*110v»rtnr« : public MTOFFile C 
void StartTreck(Lot trackno); 
static ini iB_nNotts[I [2]: 
public: 

WsrfT«nOverture{std: Eofetream& rfile) : 

BlDiriletrfile. 1* 3. 120) I ) 

U 

Static const int iv a 100; // note interval 

static const int nt = 60t // lat note of song 

iut WmTallOverture: :m_hNotflS[] [2] “ [ // {note, delta] 

[nt.0},Lnt. iy/23,{nt.iy/2|.(nt/iw).[nt.iv/2].[nt.iv/Zh 
[nt,iv]. [nt, iv/2].[nt+5.iv/2 3, lnt+7.iv].Cnt+9,ivJJnt,ivl, 

[nt,iv/2 J,fnt,iv/2],{nt t iv],{nt„iv/2].(nt *5,iv/ 2 ).[nt+5,ivj, 

[nt+9 ,iv/2],tmt+?,Iv/2),[nt+4.iv].[nt.iv],{nt.iv],[nt. iv/2], 

(nt,iv/2),Cnt,ivj d {nt, iv/2] H [nt,iv/2),(nt,iv),{nt,lv/2]„ 

[nt+5.iv/2].tnt+7,iv].lnt+9 ,£vJ,Ent+5.ivl,tnt+9.iv/2]. 

(nt+12,iv/2j ,(nt+10.iv+l],fnt+9,iv/2],{nt+7,iv/2], 

[nt+5, iv/2]. Lcit+9. Iv]. [nt+5 r ivl. 

[0, iv] 

}f 

void WmTEllOverture: :StartTrack(int trackno) 

t 

avitcb (trackno) { 
case 1: 

// tempo Irsck 

TexrEvent[0. HETA_SEQ7RKNAME, 21. *f£liian Tell OverTure"); 
Tempo(B. 2SWM3; 
break: 
case 2: 

fi -piano track 

Te*tRvent[0. NETA_SEQTRKNAJiE. 5, "Piano"): 

ProgramChange(0, 0,0): // channel 0 = annuatie piano 

// — play the notes 
int i: 

far (i = 0; m_nHot&«[II[0] 1= 0; i++) { 

if (i > 0] 

WateOff(m_nHotes[i][ij r 0.n.nNot^f[i~l][Hj.0)3 
Motedn,0 P m_nWoteE[il[0j.64); 

J 

NotEOff {m_nMoteS[ij [lj ,0,m. n'Satesti-ll [0] ,0] ! 
break: 


case 3: 

t 

// —— drum track 
conat int shot - 42; 
eonat int crash = 49: 

TeictEvent [0, f*ETA. SE.QTE3tHAHE, 5, ’'Druraa") 3 
ProarainChange({J. 9, 0): 
for (i = 0 r n.. nJfotea [il [0] t = 9; i++) 
HoteOn(m.nMotea[i][1].9.shot,fi4}; 
HoteGn [ iv, 9, trash, 1M): 
break: 


default; 
break: 

1 

] 

int mainO 

[ 

a td:: of at i earn ifileCWTQ.mid". 
WkTel lOver turn wtoUfll#); 
irto. MriteHIDIF il a (]; 
raturn 0: 

T 


atd::ios: i bins ry): 


ALGORITHM ALLEY 


Listing One 

// singly Linked Liar Ouickaort 

private Node Quicksort{Node before. Itoda first, int n) [ 
int Nual=0. HLra2=n. 1=1: 

Node S’ivotvfirei, aNode*first. aNo4<tPrav"f£rBt; 

// Fivot Advancement 
for {1*1: l<r: £++. aNode^aNode-hexc) { 
if {aNode .compareTa (aNade, next) 5 S>'i 
break: 

if f(iSl)==0) l 

Pivot « Pivot r nextt 
Jfuml++T 

) 

3 

// ReeogjUaa sortedneaa in linear time 
if (i =- n) return firatf 

// Partition liat by unlinking nodes with values lees 
// than the pivot and pushing them onto front of list 
for (aNofePrev = aJtode: i C n; i++) { 
aNod e = uNodcP rev.next; 
if [Pivot.conpareTo{aHoda] >0) £ 
aNodePrev. next = aJfod e. nest; 
aHade.Tiext r* first: 
first ■ afLode: 

Nunl++: 

] 

elaa aNoieFrev = aNode; 

] 

if {baforeI=nuil) before.next = first; 

Nuini J ■ n - Nu»l - 1; 

// RECursE to sort subliats 

If {Nurnl J 1] first = Quicksort (bafore. first r Num.1): 
if (Num2 > 1) Quicksort(Pivot. Pivot.next. Wum2)! 
return flrat: 


Listing Two 

// Singly Linked List Recursive Kerge Sort 

private void mjrgesort(Node lief&re. Node FI. int Nl. Nod*Pair Np) { 
if (Ml <=* lj 

NP.first * NP r last = FI: 
else I 
Node F3: 
iut N2f 

N2 = N1; HI •»= 1: N2 -= NL; 
margaeort(before. FL. til. NP) t 
FI = NF.first: 

F2 * HP. Lfcflt.naxt: 

mergeaort(NF,last r 72, N2, NPI; 

¥1 * NP.first: 

merge(before. El, Nl. F2. N2, NPJ: 

> 

private void merge[Node before, Node FI, int Nl, 

Node F2 r inr JH. NodePair N?) L 
Mode firet=null, last=null, temp=null; 
int I, J: 

first - last = Pi.coapareTo(F2) <= 9 T Pi : F2; 
for £I=J=0; I c K| || J < N2: ) [ 
if [I t Nl bk (Jk-N2 !E F1.compareTo(E2) <= 0)) 

( temp = Ft: Fi = FI.next; 1*+; J 
else ( temp * F2| M = F2,next: J++; ] 

last .next - tM*p; 
laat = temp: 

] 

if (before = null) Pirat = first; 
alee before.next ■ first: 
laat.next = F2; 

NP,first * first: 

NP.last = last; 


Listing Three 

// Simpler Non-recursive Merge Sort 
// NOTE: Use same merge[} as in Listing Twn 
private void nr2..morgosort{) [ 
int i* j* k. Ml, H2; 

Node Pi. F2. bflforo: 

NadeFair NP = new NodePairC)r 

for (i*ti i < NumKodes: 

for (beforer=nuli, Nl=N2=i, j=0, r j +N1 i NunNodea; j+=i<tl) L 
FI = F2 = befare=null 7 Firat ; before .next: 
for [k=0; kfNl; k++) 

F2 = E2.next: 

if (N2 > MumNndes-j-Hi} 

N2 = NumNades-j-Ml: 
merge(beford. El, Nl. 12. »2, NPJ; 
before = MP.laat; 

3 

3 


Listing Four 

// Singly Linked List Binary Eesrch 
public Node binarySeareh(Object SearthKay) I 
Node PartitionFirst=FirEt, MidPrr=null: 
itti PatliliohSiac-NuniNodcs. Mid, j. K&gult; 

(continued on page 138) 


Dr. Dobb'sJournal May 1998 


137 














(co niinued from page 13 7) 

while (PartitionSize > 0} ( 

Mid =■ PftirtitionSii« / 2: 

MidFtr = PartitionFirBt; 
for {1*0: I < Mid-; I++) 

HidPtr = HidPtr.next; 

Result = MidFtr-coapareTofEearchKey): 
if (Result > 0) 

PartitionSize = Hid; 
else if (Result f 0) ( 

PartitionSize -= Hid; 

Parti ttanFirat ■ HidPtr: 

) 

else return. HidFtri 

} 

return null; 


UNDOCUMENTED CORNER 


Listing One 


GLI ( f* vSfi made */ 

if (EFLAGS.I0PL « i) [ 
then EFLAGS-IF = 0: 

else if (CR4.VKE =* 0) t /* IOPL < 3 */ 

then KGP(0); /■ ERROR CODE = 0 */ 

else EFLAGS-VIF - 0: 

J 

J 

i 

ST1 [ /* lode */ 

if (EFLAGS.IGPL = 3> C 
then EFLAGS,IF * U 

else if (CR4.VME == 9] ( /+ IOFL <3 ■/ 

<GP(B); h ERROR CODE ■ 0 ■/ 

else [ 

If (EFLAGS,VIE == \) l h it VIP alrendjpl. #GF{?) */ 

then ftGPfth /* ERROR CODE =0 •/ 

/■ Processor never pet VIF */ 

/* before flCPCJ */ 


else EFLAGS.VIF = 1; 

] 

1 

} 

> 

1 

PUSH? t /* v86 node *f 

if (EFLAGS■IOPL « 3) t 
then 

if (Operandsite == 32) t 

then push(EFLAGS A ftxFCFFFF); /* Clear VM A KF */ 


Wnas 


QflOWbound. Softw a 

L HIGH PERFORMANCE 



Imaging, Annotation & Plug-in 
Development Tools 

RasterMaster 7.0 Powerful, multi-platform raster 
imaging support for 60+ Raster Formats. Familiar API and 
well-proven technology in use worldwide by companies 
such as BP, Chase Manhattan, Ford, Gannett, HP, IMS1, Kodak, 
LEXfS-NEXIS, Polaroid, Toyo, Unisys, Xerox and more. 
Formats: TJF, JPEG, MOiDCA, CALS, PNG, DICOM, Rashpix, 
BMP, BRK, G3, G4, GIF, JED, KFX, MAC, tOCA, PCX, TGA, 
WFX and more. Features include: {* New for Version 7) 

* Compression ■ CMYK 4 Plane Support * 

* Image Processing "10-16 bit Gray Scale * 

* Despeckle & Deskew * Anti-Aliasing in Silver * 

- Multipage TWAIN Scanning * Fit to Width & Height * 

"Pan, Zoom, Scroll 

- image Editing 

* Auto Aspect Ratio 

- Alpha Channel Support 

RasterNote 2.0 Annotation/Redlining toolkit. Features 
include: Sticky Notes, Lines, Ellipses, Freehand Drawing, 
Polygons, Highlighting, Redaction, Multiple Annotation layers. 

HasterNet 2.0 Plug-in for Netscape and Internet Explorer. 
Read 60+ Raster Formats! Simple drop In library makes it easy to 
enable standard browsers to read your Intemet/lntranet pages. 

Platforms Include: Win 95WT, Win 3.x, Mac 68K/PPC, UNIX, Alpha, OS/2 
Languages: DLL, VBX. ActiveX/OCX, Delphi, FoxPro, PB, VC++ 

Call 617-630-9495 Or Visit Us At 

www.snowbnd.com Email Sak;sdd^srtcmrbnd,oom 

See our Website for FREE, HQ RISK EVALS! 

29 Crafts St, Suite N v 


* Tiled Image Support * 

■ Medical Imaging 

■ Animated GIF * 

* Image Encryption * & mom 
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: 32)) 1 /* IOPL < 3 
*/ 


else push(FLAGS): 

1 

/+ IOPL < 3 

ela« if ((CM.WE *= i: (OporandSize : 
then rtGPfr )3 /■ ERROR CODE = ( 

rise [ 

TEKF = FLAGS 3 

TEMP = TEHP OR 0*3000: I* Sex I0PL^3 on stack (dunfel) */ 
TEMP, IF = EFLAGS,VIF 3 
push(TEMP): 

1 
3 


) 


i 


POFF ( /* rode 

if (EFLAGS,IOPL == 3) ( /* IDPL = 3 

then if (Operands!** — 32) C 
then l 

TUMP * pcp{); 

/* Clear these fields from EFLAGS stack image: 

TEMP *■ TEMP AND NOT 0h1BB02A; /* VJP VIF VN RF IOPL 

{+ Keep these fields from current EFLAGS register: 
RPLAGS = EFLAGS AND 0x183002; /• VIP VIF VM RF IOPL 

EFLAGS = EFLAGS OE TEMPI 
J 

else { 

TEMP = popO; 

/*■ Clear these fields frsn EFLAGS Stack image: 

TEMP - TEMP AND MOT 0*B02A; /* IOPL 

/* Keep these fields from current EFLAGS register: 

FLAGS = FLAGS AND 0x3002; /* IOPL 

FLAGS = FLAGS OK TEMP: 


(DperandSize ==32)) [ /* IDPL t 3 
/•* If stack image TF=1, then. DGF 


1 

else if {(CR4.VME = 0) 
th«h IGF(IS); 
else if ([BP].TF) t 
than #CP[0) 
else if ((VIP = 1) LA fSP.rF]) ( 
than #GP(0): 
else ( 

TEMP ■ pop,tV* 

EFLAGS.VIF = TEHP.IF; 

/•* Clear these fields fTdm EFLAGS stack 
TEMP = AND MOT EiB22Af /+ IOPL, IF 
/* Keep theea fiolde from current EFLAGS reg 
FLAGS = FLAGS AMD 0*3202j/i IDPL, IF 
FLAGS * FLAGS OR TEMP: 

) 

3 

3 


*/ 

*/ 


*/ 

*/ 

•/ 

*/ 


i 


) 


1 


I RET ( /* VB6 Ujo4fe 

if (EFLAGS,IOPL ==31 /* IOPL = 3 

then if (OperandSize 32) ( 
then ( 

EIP = pupO ; 

CS = p»p(): 

TEMP = pepO; 

/+ Cieer these fields fro* EFLAGB stack image: 

TEMP * TEMP AND NOT 0X10M2A: /* VIP Vi? VH RF IOPL 

h Keep these fields fram current EFLAGS register: 
EFLAGS = EFLAGS AND 0X103M2: /* VIP VIF Vh RF TO PL 

EFLAGS = EFLAGS OR TEMPj 

1 

alee ( 

IF ■ popO 1 
CS = popO i 
TENF - pop!); 

/* Clear these fields from EFLAGS stack image: 

TEMP = TEMP AND EOT MxhMJA: /* IOPL 

h Keep these fields from current EFLAGS register: 
FLAGS = FLAGS AMD 0*3002; /* IDPL 

FLAGS = FLAGS DR TEKF; 

3 
3 


*/ 


*/ 

*f 

*/ 


*/ 

•/ 

•/ 

•/ 


aUa if ((GRA.VME = 0) M (OperandSize = 32S) l f* IOPL < 3 */ 

then HOP(0J; 

alee if ([SP].TF) ( l* If steek image TF=1, then ^GP ■/ 

then PGP(0 )3 

elaa if ((VIE = 1) S* [SP.IF]) ( 
than HGP(0)j 
else { 

LP - poKOr 
CS = pop()[ 

TEMP = pop{); 

EFLAGS,VIF = TEHP.IF; 

/* Clsar These field h from EFLAGS stack Image: +/ 

TEMP = AND MOT feiB22Ar /* IOPL, IF */ 

/*********** ***************************** ************/ 

/+ MOTE The fsllcrwing treatment of TF flag */ 
f* ********** +XAY RE A BDG» in the Pentium, */ 

h POFF usee the FLAGS mask of 0x3202, but 1RET */ 

/* uaes 0x3302. Conaider that FQPF and IEET */ 

f* HGP on a condition involving TF. So is it */ 

h juat eoincldenae that IRET has further if 

f* special treatment of TF on IRET, where POPE */ 

/* does not? if 

l *************»*********** ********* *****************xj 
f* Keep these fields from current EFLAGS reg */ 

FLAGS = FLAGS AND 0*33023/* SOFL, IF, TF */ 

FLAGS = FLAGS OS TEMP; 

1 
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W here the corporate 
world comes to shop. 




If you re a corporate professional looking for real solutions 
to your most challenging IT problems, PC EXPO is the one event that 
delivers all the goods. 

More than 800 IT suppliers will come to PC EXPO 
with one thing in common: corporate IT solutions — 
and to attend over 50 sessions within our Corporate 
Education Program. 

We deliver virtually every technolog)' for serious profit - 
along with the resources and information you need to leverage 
them. From die desktop to the enterprise, it's all here. Hardware. 



Software. Peripherals. Notebooks. Servers. Storage. Networking. 
Telecommunications, Mobile/wireless. Computer telephony. NT.Java. 

We mean all. Including an explosion of new Web and 
e-commerce solutions at our best-everWEB.X — The Internet 
Event for Business,held concurrently during PC EXPO. 

The world's hill of IT events. But for corporate 
professionals, there’s really only one, PC EXPO in New York. 

lo attend, call 800>829*3976 T ext. 2980 or register 
online atwww.pcexpo.com. 


EXPO 


Exhibitors! Reach one-half-ff'////0f> dollars in IT buying power! 


PC EXPO Exhibits * June 16-18,1998 * Jacob K.Javits Convention Center * NewYorkCity 
Corporate Education/Conference Program * June 15-18,1998 * Held at the Marriott Marquis 

in M ilJer Free ina n pc EXPO i n N ew York is a registered tredema rk of Wilier Freeman, Inc. • One Penn Plaza * N e^ York. NY 10119 

PCEXP0 and WEB .X are registered trademarks of M iEler Freeman, Inc. * One Penn Plaza -11th Poor * New York, NY 10119 ■ 800-829-3976 ext. 2980 ■ 212-714-1300 - Fax 212-643-4802 
PC EXPO is fartrade professionals only. No one under IS ad mined. Your badge and earner are non-ua referable and will be confiscated and/or terminated upon any attempt to transfer or sell them 
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TO BE NAMED MVP 
IN THIS BOWL, 

YOU DON'T HAVE TO BE 
GREAT AT PASSING, 
TACKLING OR RUSHING. 


YOU JUST HAVE 
TO BE A TOTAL GEEK. 


2D ZIFF-DAVIS 

presents 

iimimiiifinntH 

It’s The Tenth Annual Computer Bowl, where computer industry leaders from the east and ? 

west coasts battle for the title of "Super Sages of Cyberspace’.’ They compete to 
earn bragging rights, but more importantly, to benefit the education and preservation efforts | 
of The Computer Museum. For tickets, broadcast dates and complete information, 
or to just brush up on your trivia, check out our website at www.computerbowl.org. - 



Underwritten by : ^ Ba V Networks Created by, 

mil trt hta mi I Id n Flow wZ The Gmputcrliwdt* 

Official sponsors; Arnoud Communications 
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PROGRAMMER'S BOOKSHELF 


Fuzzy Logic 


Peter N. Roth 



C omputationally, it Is convenient to 
represent logical values with a sin¬ 
gle hit—False being represented by 
0, and True by 1. Fuzzy logic does 
not refute reality, but rather extends the 
convenient binary representation into the 
floating-point range [QT).,,1,0L Thus, it 
can be said that perception becomes pan 
of the computational milieu. “The sky is 
blue" 1 may be 95 percent true at 10 am, 
and live percent true at sunset. The para¬ 
dox point is 0.5; at that point, truth and 
falsity are equal, 

When designing a system to control a 
washing machine, for instance, you use 
perception to state what the system is to 
do. At the early stages of design, these 
statements take the following form: If the 
wash water is dirty, then add more de¬ 
tergent. This if-then statement is called 
a rule in a fuzzy system. It’s fuzzy, be¬ 
cause quantities are not mentioned (yet). 
Usually, several overlapping rules are 
used to model an entire system, so a sec¬ 
ond rule controlling the cleaning system 
might lie as follows: If the wash water 
is very dirty, then add a lot more deter¬ 
gent. The difficulty, and art, of control- 
system design is determining the mean¬ 
ing of “water is dirty," and determining 
how much detergent is “more detergent." 
One might think of a fuzzy cleaning sys¬ 
tem as one that takes a poll of water 
quality and detergent quantity experts, 
and acts on some combination of their 
advice. 

Because these are computerized sys¬ 
tems, the programs can revise their own 
rules. Thus, fuzzy systems can be said Lo 


Peter is president of Engineering Objects In¬ 
ternational producers of commercial and 
custom Delphi and C++ components. He can 
be contacted at htlp:/Amm). ineo n research 
. com/eov or petemtb@erols. com , 



Fuzzy 

Engineering 


Ban Kosko 
Prentice-Hall, 1997 
547 pp., $80.00 
ISBN 0-131-249914S 



Applications of 
Fuzzy Logic: 
Towards High 
Machine Intelligence 
Quotient Systems 

Mohammad Jatmhidi, Andri Tilth 
Lot ft Zadeb , Serge Boverie (editors) 
Prentice-Hall, 1997 
423 pp., $74.00 
ISBN 0-133-62831-0 


M Tfoe Design and 
Development of 
j Fuzzy Logic 

Byron Miller 

Impatiens Publications, 1997 
142 pp., $52,00 
ISBN 0-963-66370-4 


learn how to improve their own respons¬ 
es. The study of fuzzy logic Is an attempt 
to codify these ideas, and place he dis¬ 
cipline on a firm theoretical basis such 
that control systems can be built eco¬ 
nomically and confidently, 

I first heard of fuzzy logic in 1993. when 
a friend urged me to read the semipopu- 
lar treatment of the topic in Fuzzy Logic: 
The Revolutionary Computer Technology 
That ts Changing Our World, by Daniel 
McNeill and Paul Freiberger (Simon & 
Schuster. 1994). Given he pace of change 
these days, it seemed time to revisit he 
subject. 


Fuzzy Engineering 

Fuzzy Engineering, by Bart Kosko, is or¬ 
ganized into 15 chapters. Each chapter 
presents a single topic at what T would 
guess is a graduate level (this comes as 
no surprise, considering hat Kosko is a 
professor at the University of Southern 
California), The mathematics in all the 
chapters is challenging. 

A list of references follows each chap¬ 
ter. The overwhelming majority of he ref¬ 
erences date from 1985, and he most- 
recent reference in each chapter is (at 
most) two years old, so the material is cur¬ 
rent. In addition to Kosko, there are 11 
contributors to the text. While the book 
claims a single editorial control, there is 
some overlap and repetition of material. 
Each chapter repeats he mantra “a fuzzy 
system is a set of fuzzy if-the?i rules that 
maps inputs (the if parts) to outputs (the 
then pans)," 

Each chapter concludes with home¬ 
work problems, J would have appreciated 
a section that gave answers to, say, odd- 
numbered problems. That here is no such 
section suggests lhat Lhe book was not 
designed primarily for self study, but to 
be used in a Fuzzy-logic curriculum. 

Kosko's focus is the .Standard Additive 
Method (SAM). In his fuzzy-logic tech¬ 
nique, the then parts of the rules are sim¬ 
ply summed, rather than being combined 
with more elaborate techniques. Although 
this simplifies the computations, the SAM 
approach is not without its difficulties. Be¬ 
cause good approximations require many 
rules, the SAM can suffer from rule ex¬ 
plosion— that is, it may require more rules 
to approximate a good solution than can 
l>e computed in minimal time or with the 
computing facilities allocated to the job. 
Attacks on this problem are made in sev¬ 
eral ways: by altering the shape of the 
rules such chat they cover ellipsoidal patch- 
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es rather than rectangles; by training the 
systems to learn and adapt their solutions; 
by making some parts of the system 
“smarter”; and so on. Programmers will 
recognize these strategies as attempts to 
deal with the law of the conservation of 
hair, which is stated roughly: In a hairy 
computation, you can move the hair 
around, but the quantity of hair remains 
constant. 

Other chapters develop and apply 
fuzzy- logic theory to chaos, recursive par¬ 
titioning, signal processing, communica¬ 
tion, hardware, and cognitive maps. Cliap- 
ter 12, entitled “Fuzzy Cubes And Fuzzy 
Mutual Entropy,” is a tour de force. Giv¬ 
en that a single fuzzy variable can be 
mapped continuously over the [(0,0),(1,0)] 


line, then two fuzzy variables map to the 
unit square [(0,0),(1,0)X 1,1),(0,1)]; three 
map to the unit cube; and n variables map 
to die n- dimensional hypercube. The in¬ 
formation state within the fuzzy cube is 
condnuous, with the center paradox point 
lxring the fuzziest of the fuzzy, Kesko de¬ 
duces from die presence of die divergence 
operator in the information field equations 
that fuzzy computations can be thought 
of as information fluid calculations. 

Fuzzy Engineering is a product of at 
least 12 audiors. Ir is therefore appropri¬ 
ate to indicate such collaboration on the 
cover. Jn fairness to the other contribu¬ 
tors, Kesko should have lieen listed as die 
editor, rather than implied to be the sole 
author. 


Kosko is best suited to the fuzzy-logic 
practitioner, or to a graduate student fol¬ 
lowing the Kosko curriculum: This is not 
a starter book. The hook does include a 
3.5-inch diskette with software dial runs 
on DOS and Windows 3x. The programs 
are demos only; no source code is pro¬ 
vided The fuzzy cognitive map demo is 
only used)! alter thorough study of die 
text. 

Applications of Fuzzy Logic 

Applications of Fuzzy Logic , edited by 
Mohammad Jamshidi, Andri Titli, Lotfi 
Zadeh, and Serge Boverie, is a collection 
of 20 papers by an international panel of 
audiors, The spectrum is wide: The sys¬ 
tems to be measured and controlled include 
mobile robots, active autosuspension sys¬ 
tems, traffic noise, cement production, au¬ 
tomotive powertrains, locomotive slip, shut¬ 
tle training aircraft, and zero-sum games. 

Each paper has the familiar engineer¬ 
ing paper outline: 

* The problem is described, 

* The theory that will lie applied to solve 
the problem is described. 

* The particulars about the project are de¬ 
lineated. 

* The data are presented with graphs and 
text, 

* Conclusions are drawn, and recom¬ 
mendations made, for future work. 

Because die authors work independent¬ 
ly, and produce their papers to stand 
alone, there is some overlap in specifying 
the fundamentals of fuzzy logic. 

The first beef (of two) that I have with 
Applications of Fuzzy Logic is the New 
Registered Phrase, On the cover, it’s pre¬ 
sented as “High Machine Intelligence Quo¬ 
tient Systems®, “ In the “Preface,” the text 
is self-praised with the sentence “This is 
one of the first volumes in which MIQ® 
is brought to the attention of the scientif¬ 
ic community.” I was disappointed that 
the index had no entries to what was ap¬ 
parently an important idea. A careful 
search of the text revealed Lhe single men¬ 
tion of Machine Intelligence Quotient on 
page 103, without the ® mark. The text 
does not explain MIQ®, nor why it is im¬ 
portant. 

The second lieef is that the four editors 
who collected the papers and prepared 
them to lie published should have refined 
the material: You can find clinkers such 
as, “The essence of the developed in the 
work ideas allows as well come along oili¬ 
er way, that is to design original [system] 
for the new plants being constructed.” 
Practitioners are more likely to be inter¬ 
ested in the mathematics and the graphs 
than in pellucid syntax, so this is more of 
a cavil than a beef. 
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judging by die dates of the references, 
the papers presented here are state of the 
art. Practitioners in the control field will 
want to have a copy of this book on their 
shelf, but are likely to be interested only 
in the papers that apply to their discipline. 
Applications of Fuzzy Logic does not in¬ 
clude a diskette, and the papers contain 
little code. On the other hand, the audiors 
occasionally mention the languages in 
which their control programs are written: 
Fortran, C, Madab/Simulink, and Pascal 
Computers mentioned in the papers in¬ 
cluded a DEC PD P-11/44, 20-MHz Inmos 
T8GQ Transputer, 386 laptop, and IBM 
RISC6GG0. 1 can only conclude dial fuzzy- 
logic engineers are more interested in get¬ 
ting dieir jobs done, than in exploiting die 
hyped technology du jour. 

The Design and Development of 
Fuzzy Logic Controllers 

As I scanned Ihe Design and Deivlopment 
of Fuzzy Logic Controllers, by Byron Miller, 
several simple drawings illustrated what 
looked like interesting low-budget pro¬ 
jects. However, the “graphics 11 ate the bane 
of this book. 

On page 2, Figure 1.1 is supposed to 
show a dynamic system. Instead, it shows 
graphs of continuous and discrete data, 
and the graphs are mislabeled. 

On page 3, Figure 1.2 is a duplicate of 
Figure 1,1. Again, the continuous and dis¬ 
crete data labels are swapped. 

After an abbreviated and fairly clear 
survey of fuzzy logic, an input file for Lhe 
FuzzyStal system is listed in its entirety. 
It is followed by an unintelligible and un¬ 
explained “3D graph/ Several addition¬ 
al unintelligible input files are included, 
each accompanied by an equally opaque 
graphic. 

The software-development process is 
diagrammed with the hoary waterfall di¬ 
agram. Unfortunately, the diagram is said 
to demonstrate the spiral model of de¬ 
velopment. 

These, and other errors, make the text 
unreliable and not at all credible. 

Save yourself $52.00 and skip this one. 


Programmer's Bookshelf 

Submissions to Programmers Book¬ 
shelf and the Dr. Dohb's Electronic 
Review of Computer Books (http:// 
www.ddj.com/ercb O can l>e sent via 
e-mail to editors@ddj.com or mailed 
to DDJ, 411 Borel Ave,, Suite 100, 
San Mateo, CA 94402. 
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Aitrasoft lias released OO-Browser, a 
h igh- performance, mu kitangu age object- 
oriented code browser, OO-Brow.ser sup¬ 
ports C, C++, Lisp and CLOS, Eiffel, Java, 
Objeciive-C, Python, and Smalltalk. OO- 
Browser works under Windows, MS-DOS, 
and many UNIX variants. Complete source 
code for the browser is included for local 
adaptation or extension. Most operations 
that can be applied lo a single class or 
method can also be applied to a whole 
set at once. OG-Browser runs within the 
GNU Emacs or XEmacs editors, or as part 
of Altrasoft’s IDE, InfoDock, It can also 
call out to other editors such as vL An X 
Window graphical front-end is also pro¬ 
vided. Pricing starts at $599 00. 

Aitrasoft 

4880 Stevens Creek Boulevard, Suite 205 
San lose, CA 95129 
408-243-3324 
http://www.altrasoft.coni/ 

Baltimore Technologies has released Ver¬ 
sion 3-0 of J/CRYPTO, its Java cryptogra¬ 
phy library that includes ciphers such as 
RSA, DES* Triple-DES, and RC4. J/CRYP- 
TO 3-0 lets Java programs generate, pro¬ 
cess, and validate X.509 digital certificates, 
implements BASE64 encoding/decoding, 
and includes high-speed RSA performance 
for JDK 1.1 and higher. J/CRYPTQ works 
with most versions of Java Virtual Ma¬ 
chines, including 1,02 and LI,*. 
Baltimore Technologies 
International Financial Services Centre 
Custom House Quay 
Dublin 1 
Ireland 

+353 1 605 4399 

http://www .baltimore.ie/ 

Global Checker from Uniscape is a multi¬ 
byte enabling utility that helps indepen¬ 
dent software vendors reduce the time and 
expense of delivering multilingual prod¬ 
ucts to global markets. Global Checker 
first scans any C/C++ type source files, 


then identifies and reports any code that 
is noncompliant under National Language 
.Support (NTS) standards. Global Checker 
then suggests solutions through an ex¬ 
tensive online help system. Global Check¬ 
er is available immediately in three ver¬ 
sions ranging in price from $400.00 to 
$1500.00 per unit. 

Uniscape 

303 Twin Dolphin Drive, Suite 510 
Redwood Shores, CA 94065 
650-596-1430 

http://www. u nl- scape.com/ 

Mainsoft has announced MainWin XDE 
3.0, a Windows-on-UNIX platform port¬ 
ing technology. Main Win XDE 3*0 lets 
UNIX developers take advantage of Win¬ 
dows NT development standards—Win¬ 
dows 32-bit (Win32) API support, DCOM, 
and ActiveX—when porting to UNIX plat¬ 
forms. Past constraints concerning multi¬ 
ple data object model standards are re¬ 
solved through Mainsoft’s technology, 
allowing you to deploy enterprise appli¬ 
cations across heterogeneous environ¬ 
ments, MainWin XDE 30 is available on 
the SPARC Solaris 2.5.1 platform. Pricing 
starts at $12,000 per license. 

MainSoft 

1270 Oakmead Parkway, Suite 310 
Sunnyvale, CA 94086 
408-774-3400 

http://www. mainsoft, com/ 

Imperial Software Technology has re¬ 
leased Visa], a visual application builder 
for Java. VisaJ is w ritten in Java and runs 
on any platform supporting JDK Li, 
VisaJ provides a point-and-click inter¬ 
face for building Java applications using 
Java Bean components, and generates 
pure Java code. VisaJ includes an AWT 
editor, layout editors, a class builder, and 
some essential JavaBean components, 
VisaJ sells for $995.00 for the first license 
and includes one year of support and 
upgrades. 

Imperial Software Technology 
120 Hawthorne Avenue, Suite 101 
Palo Alto, CA 94301 
650-688-0200 
http://www.ist.co.uk/ 

TopSpeed has announced Clarion 4 Pro¬ 
fessional Edition, a programming envi¬ 
ronment that generates pretested code that 
is automatically created from tw'O-way 
templates and wizards. Its data-centric en¬ 
vironment includes a comprehensive set 
of objects tliat provide reusable services 
for accessing, processing, reporting, and 
displaying data. Clarion 4 includes Top- 
Speed's Database Accelerator, a database 
driver enhancement that incorporates 
caching to improve data delivery speeds. 


Via an alliance with Pervasive Software, 
Clarion 4 comes with Pervasive’s Scalable 
SQL ( two-user licenses) at no extra charge. 
Clarion 4 retails for $499.00. 

TopSpeed Corp, 

150 East Sample Road 
Pompano Beach, FL 33064 
954-78>4555 

http://www. topspeed. com/ 

Objective Toolkit/X 1.0 from Stingray Soft¬ 
ware provides the capabilities of Stingray’s 
Objective Toolkit PEG to Visual Basic pro¬ 
grammers as an ActiveX control. Using 
Objective Toolkit/X, you have access to a 
docking form and can dock or float any 
MDI child form in Visual Basic, Objective 
Toolkit/X also includes CollectionX, an ex- 
teasion to Visual Basic’s Collection object. 
Objective Toolkit/X L0 costs $395-00 per 
developer. Each copy of Objective Tool¬ 
kit/X includes 60 days of technical sup¬ 
port. Subscriptions, which include prod¬ 
uct updates and technical support for one 
year, are priced ai $195.00. 

Stingray Software 

9001 Aerial Center, Suite 110 

Morrisville, NC 27560 

919-4614)672 

http: //www . stingra y. com/ 

Global Technologies has announced its 
SDK for U/WIN, a UNIX-like environment 
for Windows-based systems with tech¬ 
nology developed at Bell Laboratories. 
11/WIN comes bundled with more than 
150 POSIX-compliant shell commands and 
utilities, and also runs die Microsoft Win32 
processing mode. U/WIN is compatible 
with most UNIX flavors. Pricing begins at 
$199.00 for Windows NT, $99-00 for Win¬ 
dows 95, 

Global Technologies Ltd. 

5 West Avenue 
Old Bridge, NJ 08857 
609-722-0906 
http://www .gtllnc.com/ 

Ahpab has announced its SourceAgain 
Professional java decompiler. SourceAgain 
Pro uses flow analysis techniques to 
quickly and accurately translate Java class 
file bytecode back into Java source code 
equivalent to the original code. It also can 
augment the original class file with de¬ 
bugging information so that developers 
can step into the source code without 
ever needing to recompile. SourceAgain 
Pro runs on PCs or UNIX workstations, 
and works with existing IDEs. Pricing 
starts at $99.00. 

Ah pah Software 

2250 Latham Street, #21 

Mountain View, CA 94040 

650^960-2472 

htt p:// www .ah pah. com/ 
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LOOX Software's LOOX 33 and LOOX++ 
3 3 are updated versions of its flagship dy¬ 
namic grapiiics and data visualization tools 
for UNIX and X/Motif developers. New 
features include support for animated and 
transparent GIF images, a scripting lan¬ 
guage for configuring complex behavioral 
relationships between objects, improved 
integration with existing Xlib code, addi¬ 
tional predefined dynamic objects, en¬ 
hanced annotation and network repre¬ 
sentation capabilities, advanced 2D charting 
capabilities, and extended support for hard 
copy output. LOOX 33 and LOOX++ 33 
are available on HP-UX and Sun Solaris. 
A single-user license is priced at $8950,00 
with no royalties or run-time fees. 

LOOX Software Inc. 

4962 El Camino Heal, Suite 206 

Los Altos, CA 94022 

650-903-0942 

http: // www, loox, com/ 

TechExeels PowerTrack 2.0 is a web- 
enabled defect tracker for software pro¬ 
ject management, PowerTrack tracks and 
manages all defects, dncumentation/fea¬ 
ture requests, and development issues. 
New features include integration with ver¬ 
sion control software (Microsoft Visual 
SourceSafe), time searching, time track¬ 
ing, enhanced importing and exporting, 
and the ability to create and use your own 
custom Crystal Reports within PowerTrack 
projects. Standard features include a scal¬ 
able client/server architecture, universal 
ODBC support, and presentation-quality 
reports and graphics. PowerTrack 2.0 is 
priced at $389,00 per license, A Fully func¬ 
tional evaluation copy is available at 
htt p: //www, pc jwertrack.com/. 

TechExcel Corp. 

3400 Ml. Diablo Boulevard, Suite 200 
Lafayette, CA 94549 
510-283-8930 

http://www. tech excel .com/ 

NeoMedia Technologies has announced 
ADAPT/2000 Release 3.0 of its cross¬ 
platform Cobol Year 2000 tool set. 
ADAPT/2000 Release 3-0 provides auto¬ 
mated source code conversion to imple¬ 
ment IBM's recently-announced MLE (mil¬ 
lennium language extensions) for Cobol, 
which provides a new r syntax to auto¬ 
matically support windowing, ADAPT/ 
2000 Release 3-0 allows implementation 
of a combination of MLE, expansion, and 
windowing of non-MLE date formats, and 
provides automatic inline bridging be¬ 
tween date formats and generation of data 
conversion programs where expansion is 
used. Other new additions include re¬ 
ducing the number of code passes to cat¬ 
alog duplicate Helds and create synonyms, 
greater programmer control over the au¬ 
tomated source code changes in the re¬ 


mediation process, additional field-level 
overrides such as custom window pivot 
dates or conversion formats, more inte¬ 
grated support for embedded SQL, and 
numerous usability features, ADAPT/2000 
starts at $20,000 for Windows NT, and 
$30,000 for UNIX. 

NeoMedia Technologies 
2201 Second Street, Suite 600 
Fort Myers, FL 33901 
941-337-3434 

http; //www.neomedia- tech.com/ 

Dundas Software's Ultimate Tool Box is a 
collection of over 200 MFC Extension Class¬ 
es covering areas such as GUI, file man¬ 
agement, communications, graphics, MAPI, 
and utilities. Ultimate ToolBox retails for 
$499,00, A yearly subscription costs an ad¬ 
ditional $249.00, and includes continuous 
additions of classes as they are developed, 
Dundas Software 

4800 Dundas Street West, Suite 500 

Etobicoke, ON 

Canada M9A 1B1 

416-239-7472 

http://www.dundas.com/ 

Ikonodyne lias announced its Morphous 
Object Technology (MOT), a component- 
based approach for developing CORE A 
applications. MOT provides an IDE to cre¬ 
ate, deploy, and manage CORBA objects 
written in Java. MOT uses wizards to al¬ 
low users to automatically build CORBA 
objets. CORBA objects can lie converted 
into Java Beans. MOT provides an admin¬ 
istrative and monitoring tool with its ob¬ 
ject migration capabilities. MOT supports 
both customized and commercial ORBs, 
including Visigenic s Visibroker 3.0, Pric¬ 
ing begins at $1450,00 per seat, and in¬ 
cludes one half day on-site training. 
Ikonodyne 

2680 Bayshore Parkway, Suite 106 
Mountain View, CA 94043 
650-938-3570 

htt p://www. i kon i xly ne, com/ 

SuperCede for java 2.0 is a development 
environment which offers complete sup¬ 
port for JDK 1,1 and JavaBeans. SuperCede 
2.0 supports compilation from Java source 
and bytecode as well as the dynamic load¬ 
ing of bytecode into running executables. 
SuperCede 2,0 Standard Edition costs 
$99*00, and includes a form-centric envi¬ 
ronment, database support, and over 60 
JavaBeans. The Professional Edition adds 
advanced database development capabil¬ 
ities, a fully integrated C++ compiler, and 
Acti veX/JavaBean i ntert>perability. 
SuperCede Inc, 

1 KM 10th Avenue NE 
Bellevue, WA 98004 
425-462-7242 

I ittpr//www.su percede. com/ 


ERTFS 1.0 from EBS is a file system for 
embedded applications that supports Win¬ 
dows 95-compatible long filenames, Win¬ 
dows 95-compatible FAT32 partitions, and 
enhanced support for PC Card ATA de¬ 
vices, IDE drives, LS-120 floppy disks, stan¬ 
dard floppy disks, and RAM/ROM disks. 
An API is provided for file and directory 
management, and contiguous files are sup¬ 
ported. ERTFS is provided in source form 
and may he used royalty free. The price 
of the complete package is $3825.00. 

EB5 Inc 
Box 873 

Groton, MA 01450 

978-448-9340 

http://www. etebin. com/ 

Geodesic Systems, LLC, has announced 
Version 3,1 of its Great Circle C/C++ de¬ 
bugger and memory manager for Win¬ 
dows 95/NT, which uses a new HTML- 
based GUI. The new interface makes the 
debugging process more visual and effi¬ 
cient. As with previous versions, Great Cir¬ 
cle 3-1 continues to use garbage collec¬ 
tion to debug and automatically manage 
an application's memory'. Great Circle 3-1 
supports Microsoft Visual C++ 4.x and 5.0, 
and lists for $695.00 per user license. 
Geodesic Systems 
414 North Orleans Street, Suite 410 
Cliicago, 1L 606l0 
312 - 832-1221 

http: //www geodesic. com/ 

JETS is a conformance validation suite for 
implementations of Java from Perennial. 
JETS is based on the original Java Lan¬ 
guage Specification and the Version LI 
modifications. In addition to testing con¬ 
formante, JETS enables implementation 
developers to generate bytecode from the 
test cases that can be moved to other plat¬ 
forms and tested for portability and func¬ 
tionality. 

Perennial Inc, 

4699 Old Ironsides Drive, Suite 210 
Santa Clara, CA 95054 
408-748-2900 

SojtQuad Inc, introduced HoTMetaL Ap¬ 
plication Server, a server-side web appli¬ 
cation platform that uses an XML-based 
syntax for developing applications. HoT- 
MetaL Application Server costs $495 per 
server with volume discounts, 

SoftQuad Inc. 

20 Egl inton Avenue West, 12th Floor 

P.O. Box 2025 

Toronto, ON 

Canada M4R 1K8 

416-544-9000 

mai 1 ©softquad. com 

http;// www, softqu ad. com/ 
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How to rule the game market: 
close your office for a week. 


Five days at the CGDC will make your team a lean, mean 

game-creating machine. 


You've got to be clued in to compete—and no one knows it better than game develop¬ 
ment professionals. The Computer Game Developers' Conference gives your team the skills 
and strategies it takes to produce the hottest games on the market. Next-generation 
programming. Inspirational management. Stunning graphics and stirring audio. Brilliant 
marketing. Over 200 intense classes taught by the top minds in the business, covering 
everything from online gaming to 3D to innovative distribution tactics. Go ahead—close 
the office for a week. You'll come back smarter and more productive than ever before— 
and that lasts all year long. 

(Besides, think whet you'll save on overhead!) 


Where Games .Of <0 \e, register Today lor the CGDC 

are Born may 4 0,1998 

CO M PUT= R Long Beach CA, Convention Center 

Si, ME Call 998-234-9479 or 419-909-2818 

DEVELOPERS Go To WWW.cgdC.COID 

CONFERENCE E-mail cgdC@Dlfi.COID 





Next Generation 

Software Protection 

. www.wibu.com 


The ONLY Serial Communication Libraries, DLLs, & Tools 
You Will Ever Need For Windows 3.x, Windows 95, 
Windows NT, and MS-DOS, Accept No Less Than The Best 


mpJfwwtf.wcscnetwmlftoroeMtm 


f'MiNiuiinjI Srrlnl I'O Unrln A PU i for 
Wlrtidm 3-iH Wliduw* 9S» NT, A MSI* IS 

411 link 1* 111 1 ' 1 1 1 1 IMk- 

ulf A* WlH*v, s API 


WIBU-SYSTEMS introduces 
the future of copy protection. 

The new WIBU-BOX/U combines 
the unmatched effectiveness of 
hardware based copy protection with 
the 100% trouble free operation of the 
Universal Serial Bus (USB). Call now for your free Test Kit 


Freform Supported ByAII Product* 


**?** mn 


«**t-nfcV«D. 

iron CudlllHD/B blnj. 

iron Cudtli»ll/1* bln) . 

iron CunUKSaD/li blL.). 

Iron ha) 

rTKtnilml EnAvlUbiVWI 
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.^fGRIFFIN 

V rtCHNOLQCl 1 IS 


_ r wese 

4425 Xing*™! tJr -Wu I, linvmvnL tX mw 
^ Itmnnd 66-4831 (!«U) 49 M&n 

(Pul {3IJSHJ3H (tilts MlitljWK-MtH 
Inin urt: na1cslujwc3cnEt.com 

i.iKs.'^n>fMi’.iM"ii \\\\ St hiip:.vww4VWL3,wl.CL)m:limTicli[in| 


NoH h 4nd &fliuh Aimtkit 
Griffin TeChnrFliigies,, LLC 
IfifTSt An*?** &m«. tjwrencif, KS 44D47 
T*l' (7fiS) BJ1MAF Fu ■ £78SJ BJ1-87B7 
Em4it u4ct@ptlEbLli4jeim ^wo^rtftinJi tom 


j*vjvwww witm.Kwn Quality chc- World Trusts 


COM M U NIDATION TOOLS FOR 

Windows NT/95 • DOS • OS/2 • QNX 

Use our fast deployment tools with IBM ARTIC 
realtime comm cards for: 

Bisync, async, frame relay, 

X.25, LAPS, HDLC/SDLC, 

S custom protocols . 

Multiple ports, multiple protocols on one w iJiTT iM 
card, CPU independence, & portable card J 

development in C. Visit our web site, see m Armc ; 


Hill 


tework Class Library^ 

Create high performance VxD and NT driver modules from common 
source with the Telradyne DFC library, The DFC driver wizard gets you 
started quickly and th e DFC classes simplify common driver tasks. 

Free DFC training courses (including free evaluation copy) held in San 
Jose, CA. Drop by www.tetradyne.com for more information. 

. —. , 2542 S. Bascom Ave, Suite 205 

y\ Campbell. CA 95008 

\ {4QH) 377-0367 FAX: (4D0) 377-6253 

ft w*fe sa3ss@tolradyiw.CMin 

www.telTBdyrw.coffl 


for yourself & talk to us soon. 
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We Can Help! 

Attaining Microsoft Certification is no easy task. But the professional and 
financial rewards are worth it, Transcender exam simulations give you a 
realistic preview of the nature and difficulty of Microsoft certification 
exams , Our exams show you where you need to focus your studies and 
give you detailed explanations of every answer to every question, 


Money Bock If You Don't Pass Guarantee** 


NEW! developer pak $499 

(Includes Items 1,2, & choose TWO from 3-7 listed below) 

1. WinArch-l-Cerf 1 ' 2.0 -*I43 

2. WinArch-1 E-Cert™ 2.0 fwv\dcf,n/\rm ,■ sus 

3. VB-Cert™ 5.0 *149 

4. VB-Cert™ 4.0 sn$ 

5. AttessCert™ 1.0 ■ Sizs 

6. AccessCert " 7,0 

7. SQL’Cerl™ 6.5 - S 145 

To order , cat! 615-726-8779, FAX 675-726-8884 

VISA/MC/ftMEX/DISC/MO/COD Add $4 $&h ($2S outside the US) 

ONLINE ORDERING! 

'Set W#l> iJfe fat CaftdMoH, 


MCSE exams: 

NT ffU 1 NT 

i >t i Winn, VvT Win r 

SMS txdvtfKftJ HS, K. R4fc SOI 
6i S itmfil# Admttijrfih 'f ■£* t 


Transcender" 

QH^pOTStJor. 

Sag Uxtoh Avp. 

NarfuriHn 1^37^03 
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Where do 
you find 116,000 
professional 
software developers... 

monthly? 

in 



DDJ subscribers are among the 
most technically advanced pro¬ 
grammers in the field. 


Hendela system consultants, Inc. 


Sizes your V2K 
effort across all 
applications. 

Analyzes your 
Nomad, Foxpro, 
COBOL, RPG, 
C/C++, and 
other text-based 
source code. 

Gets you 
started fast with 
1200+ date 
related text 
strings. 

Remediates 
code with a 
built-in editor 
that positions 
you right to the 
line to be fixed. 



To download a demo version or 
for additional product information, 
visit our web site: 

www.scany2k.com 

Cail us tolhfree: 

1 - 888 -SCANY 2 K 

1 - 888-722-6925 
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the secret to success 

• is no longer a ^secret 

"QJTrtisr Red Hat Lrnux Boxed Set: $49,95 
Red Hal Unux 

"PoLLrer Tools" E. archives: $99.95 
Rppllxuuare Office Suite far Linux: $99.95 

www-redhat-corr p e d Hers Mom for Linux: $149 

| ,., t , ,,\ L „: ; : ? } ^ I Rsd Hats TriTeal CUE for Linux: $99,95 
system Banks 

Or, Linux; $39.95 

4 ’ Linux Command Reference: $49:95 

k Maximum RPM: $34.95 




^ 11 a-' 

l-SBfl-redhatl !•113■5M7-DOIB 
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I TE Edit Control (Advanced RTF control) 
I HTMLViewer/Editor Add-on for TE 

► ReportEase Plus (report writer engine) 

► SpeUTEme DLL and dictionary 

I Form Plus (form designer/filler) 

I Rich Text Grid control and ChartPro 


INCLUDE COMPLETE 
SOURCE C 


ARE ROYALTY FREE 
AND AVAILABLE FOR 


— 


S ub s ystems 



SUB SYSTEMS, INC 

II Tiger Row, Georgetown, M 01833 
978-3 S2-9020 fox: 978-352-9019 
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& Design with C++ 

Need to build an objeti-oriented solution? 
Interested in Design Patterns? If so, BGGAD 
explains how to design solutions for robust, 
maintainable and logically thought-out code. 
Auihurjcssc liberty 1^61001-35^, $3495 to bookstores 
CttflfcteKcast Visit wvww m fc com fora full TOC and 
sample dim. 1-SUHJSEAVROX 


DBEBEl 
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html++ 

Product 

CGI Class Library 

Compatible with all internet web servers 
Generate interactive web pages in C+ + 
Ideal for webifying databases 
No more Perl or scripting 
Automates CGI, cookies, forms, state 
Win32, Win 16, OS/2, DOS, Uni*, Mac 

FREE DEMO Call 1-800-775-1073 


I DC Micro 
! Development 


Tel (606) 245-4175 
Fa* (606) 245-9305 
www, demi cro.com 


Network Software 
Development Toolkits 


Win32 NDtS Framework (WinDis3Z} - Supports 
development of Win32 applications that directly 
access ticiwufk adapters. 

TDI Client Samples Use TCP/IP hum device drivers. 

Printing Cnmmunieatiomi Auoc,, Inc. 

4201 Brunswick Court 
Smyrna, CIA 30080 
(770) 432-4580 


h tt p//w ww PC AU SA. com 


Generate Documentation 
from your source code with Doc Jet! 

I'nidiite HTML, 

MS Ilf Ip. and 

MSVYdaJ 

dru'iimunlalion from 
rniumcnlA: ill wmr 
cade - kind tan nun'I J 
nerd -rhanR^ yimr J 
rummi'iiltn^ style' 


Vi hi can finu- 
iuitc ywrr uatpBt 
with DucJei"! 
WYSIWYG 
duipul vtlilor- 


FREE TRIAL VERSION! 

h 1 1 p://www,U 11-tree .com 
I n fp(g; lid I -1 ret .cun i 5! 2-4 53-4 909 


The best cross -platform 
compression libraries 
for Win32, Win 16, 
DOS, OS/2, Unix, 
Macintosh, and 
embedded systems. 

Robust 45-lunction API compresses 
buffers, files, archives, disk spanning, 
encryption, seff-exfr EXE's and more. 

FREE DEMO Call 1-800-775-1073 

Id (606) 245-4175 
Fmt (606) 245-9305 
www. d cm icro .com 



[ DC Micro 
! Development 


I Boost your Basic!,. 

I_. f L If] JT.C 


'".tf 

KB 

■Boost pur programs into orbitf VB, QB, 

|Pft, C a Xbase libraries. Free demo disk 
1 4 booWel ■ call nowi tnfo@lerat 0 ch.com 
* ProMathiYB ■ numeric ProBas - DOS libs 
*FinLiWVB-inarida) 1 SpdlCliBcWVB 

^toiLibiFro - adscfiaiv' BuSafa Ms. 



Custom Programming: VB, C, Access, 
telephcny math, wet CF. ASP, Java. 


800 - 447-9120 x1336 

Depl 133$, IDO Pa* Avenue. Sole 360. Rockville MD 28850 
424-3903 Pa:301-782-8185 B8S301-762-6184 

www.teratech.com/ddj/ 




1-888-646-1933 
Bumble Bee Software 

P,0. Bo* 2007 
West ford, MA 01S&6 
info@bbeesoft.com 


► New Version 2.1. 

► Generates documen¬ 
tation, directly from the 

source code. 

► Extracts comments. | 
► User customized 
reports formats. 

► HTML WfnHeip, 
RTF, 

► FREE working 
evaluation at 
www.bbees oft .com 
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( EDUCATION 

add res s For Success 

J http://www.aics.edu ^ 

■ Earn B.S. and M.S. m Computer Science 

■ DISTANCE EDUCATION 

1 Object oriented B.S. program AMERICAN 

1 New courses in Java. Networking, HTML, MIS T"^ 11 1 Ih 
» Approved by more than 275 companies COMPUTER 

* FollowsACJA1EEEguidelines SCIENCES 

* Thousands o! students throughout U.S. 


5 c 
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PRODUCTS WANTED 


Free catalogue 1SQ0-767-AICS 
or http://www.aics. edu. 


IQ* 

IMm 


WANTED 

SOFTWARE PROGRAMS 

for partners in profit! 

You provide the completed Small Business. SOHO, Financial 
or Personal Productivity software program, F-Z L&gai 
Software will provide packaging, duplication, marketing and 
sales personnel to generate royalty checks tor you. 
w>th distribution in more than 6500 retail outlets, we have 
the experience to launch, market and sell your program at reiaif 
at no cost to you! um i-aOD-ff22-45fi& X131 

S&4 S. Military Trail, Deer Paid Beach. FL 33442 \j r " j t T 

Phone {454) 480-6433 * * Fax {954) 480-8906 

hi tp://www.e-/legal.cum — IZacktfiWlagal.cam 


-- 


SECURITY 
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BugCollector Pro 2.0 

Puts Bugs In Their Place 


Bug and Feature Request Tracking Database 

T racks 

bugs tram Initial report through final resolution 

Manages 

features from first request through impiementatkHi 

Organizes 

everything you need to create a top-quality product 

Sorts - Fitters ■ Reports * Graphs * Exports 

your software development data the way you wen! it 

Nesbitt Software Corporation 

Dfflttiload a FREE 30-day BugCollector trial from http Ji'www.ncstwtLcorrV 



FORTRAN 


The SPINDRIFT Library Version 3 


The Complete FORTRAN Programmer's Toolkit 

I Creole sopbhficoled slate-ol-llie-arl user interlaces lor your FORTRAN 
| program. Spindrift's cdlabk jebrmrtinej and functions lot you 

• Hova Join entry screens, dialog boxes, scrolling list boxes 
pull-down meniM.. push button?, balp ponds, ok. All with roll 
manse support. 

* Mudi morel 791 subroutines and lirnctlons for keybord, 
screen, moute, DOS cnnhal, and hardware status. 

| Conrpekirsive demonsri qi on proyrcn : shaars whtu you ecu do end how to do it. | 
Now shipping Version i.O with ait new User Guide. 

PRICE: 16 Bn Comp Hers - 514? 32 Bit Compilers - 5239 

I Spindrift Laboratories, Ltd, 

fa TanylewoQd Path • Galena * |L 61 DBG * USA | 

V Phone: <&I5} 777-8240 ■ FAX: (BIS) 77741411 

I E-mail: dgabl6@galenalitik.com 


Create Prolog H 
Components 

Diagnose, advise, configure and plan 
with the AmziP LogicServer" tools & 

I libraries {DLLs) for C/Oh-, Java, VB, 

Delphi, Web Servers 3 more , Win NT 
95 3.x & Solaris. Use ODBC, Sockets, 
Unicode & new OOP extensions. 

FREE Evaluation Version? 

A 1117 J hie. Email info @amzi.com 
Call +1 -513-425-8050 Fax 425-8025 


Reliable Software Security 


Cast effective products for aJI 
levels of software protection 
| and authentication, 

• CRVTTO-BOX® bmDd dwngles, 
SmartCard, PC Cards, and 
extension cards 

• Software based metering systems 

A ActiveX, OCX & VCL components 

I mk fir ▼ f : "'m ?B5 bbst fax hs-k-e 

_ WNl IC -'4 1-800-627 9468 

SOFTWARE 5 ECU gfH HM Chambloe-TucteFHd., HI 




SciTech MGL 

Now FREE with Full Source Code 


SdTech MGL is a complete graphics library for Windows 95/3.1/NT 
& DOS that has been used to develop leading titles tike kVfoOuafce© 
and Hexen /IB, SciTech MGL can be ported to other OS's in as lew 
as TDQQ lines of code. Includes: QpenGL® API support sprite 
library; Game Framework: hardware triple buffering; support for 
stereo LCD shutter glasses; automatic detection a utifization of VGA, 
ModeX. VESA VBE, VBE/AF, WinG, CreateDEBSedmn &. DimctDmw; 
and more. Supports standard C/C++ compilers S. Borland Delphi. 

For mom Information call (636) B94-0400 
Download SciTech MGL with lull source code from our website at 




CGI scripting is a piece of 
cake with Winductor 2.1 <255* 


Easy and dear syntax * USB e-mail, FTP, HFTP, 

Native la Windows HTML-parsing and fib I/O 

, t . directly tinm the scripts 

caniedlLjV^S * Script and orchestral 

actual POSfed data from otha < w " l[) “ s P rD B ,ams 
browser * Easy-to-follow CGI tutorial 


CrypKey Software Licensing System 

"Software protection with NO hardware lock and NO disk key * 


Down took your free 
evaluation copy today 


Winductar 




Tifiih iv an tWiita jwir wh.wwH life: 

■ upset opl -rmi Drat level, Jl your ■..-HIr:■ 

■ loatt* ar iJamoynui by .'uni<tf hnv 

, cr ui^jradfl yW Clrtwmffi IfipUfitty phdjmi. ft* GF 

.New! tiraquc Rcniiy T'a-Tiy rediCum upon insi-jR atom I- tiinl pmod oi-Jy per cuibHnrf. 
•Wwl uiNque AAI-On ftMtlirt-ldd mpd* OpDcrtU, k^i, Kind- of tort lu Utistiri} Gcertrti. 
Nwt rr>pH«ty injun $ minidt«(>Adh i» i«irf a (sUa cMnoa-s. 

: I,|-|nJ■ Is rrvrplftry romr..mNr vvril■ MS-DO^ MS.Wludowh 3.x, Win il\ 'iV^tS-J, WinNJ. 
tod liitt'wjes |tetwO*h I«tiW. i->i ..I No*! + <1 MMlynern ttfMsJ DttArfa 



Image Processing Library 

Feet BMP, TIFF, PCX, GIF, TGA, PNG, JPEG. Adjust 
brightness, contrast, sharpen, create filters, resize, rotate, 
+more of single image, multiple images, or any image area, 
color reduction to optimum, specific, or std, palette; print; 
scan; crop, combine, compare, blend images. 

DOS $199, 16-bit DLL $299, 32-bif DLL $499 
Catenary Systems 
314-962-7833/fax: 314-962-8037 
www. eaten ary. com/v icto r 
ask for tree demo arc avail vi$a/mo/o.o.d. 


Rainbow has software, hardware and 
internet solutions to help developers 
• Increase Sales 
m Expand Distribution 
m Prevent Piracy 

800 - 705-5552 ^RAINBOW 

www.rambow.com 



* Hypertext source & Class Browser 

■ Detailed Interactive Cross Reference 

* Invocation, Include, Declaration Heirarchy Charts 

■ HTML Publishing of Code & Analysis 

* WYSIWYG printing. Clipboard & Bitmap Output 

Download your 

eval copy today* Scientific Toolworks* Inc. 
www sc t LOO I s com 115 Etna Road Suite 18 A 



- U tALL {$99| L-i; hpniiL.Hree of canerteallMl lunction niefarcl^, cross-relierenc^ 
riB/runcCiDJl index. 

* C-CMT ($69] L;-3.:l L :L" r i:;i-rl:vijpSilfE. camrient- blacks ilifntliOns'WftfitdrrtrS 
used) lor oanh function 

* C-METHlC ($59) caitjld’e* ualh camiwitny. counts i.nes wiih comnnenfs. -code. 

‘C stiteinHnls 

* C -LIST {S5tt| lists and artHfi-diegrains, o: relornals source Into jser-salcclcd 
sl^KJard loiniats. 

*C-KEF (SG9) GrealK cross- rererencs cl facal/glObaminefpirnmoler iaentifle^, 

* C--DQC |$1 M) PACKAGE AU 5 programs inlegralpd a5 DOS prOflraitL <10SOD 
lines C-BE10W5E Windows [ffnptiic-rrHfl viewer 

■ C DOC Professtanal [$299) DOS, Windows, OS-'i' l.000.«00r linen. 

* NEW VER 7.01 WEB HTML flEfOflTSf _ 

SOFTWARE BLACKSMITHS INC. email @ swbs.corn 
60H St Ives Way, Mississauga VoicaiFan (905) 858-4466 
□NT Canada L5N-4M1 ft tip:// www.^wbs.com 


























































SWAINE'S FLAMES 



2001: The Legacy Code 

F unny, the connections the mind makes. I was sitting at my keyboard, thinking that the Year 
2000 problem was shaping up into quite a little horror show, when I suddenly realized that my 
thoughts had drifted over to the cast of the TV show 11 News Radio.” My eyes strayed to my 
science fiction shelves as my fingers began to move on the keys... 

The Internet radio station's manager cleared his throat. When this failed to get the attention of die staff, 
he tapped on his coffee cup with a pen. That worked: The cup cracked, hot coffee flowed onto his 
lap, and die staff looked up. 

Thank you. Oww. I called this meeting liecause the station is in trouble. For die past two years, since 
mid-1999, our advertising accounting software has lieen printing erroneous bills." 

The announcer opened his eyes. “Kindly do not utter the words 'erroneous' and hill" in the same sentence” 
“Sorry, Bill And please take your feet off my desk." 

“No problem, Dave. It was getting soggy anyway/ 

“Please pay attention; This is a pretty serious situation. It's this darned Year 2000 problem, and it's 
impacting the bottom line/ 

“J love it when you talk like that/ said the sultry brunette silting on the arm of his chair. 

“The problem is legacy code," he went on. 

Legacy code! The very name of eidrich horror! The entire staff shivered as though they had all spilled 
iced tea in dieir laps. The station manager turned to the Coboi mechanic who sat slouching insoudantly 
in a diair by the door, his legs stretched out in horn of him. “You're going to have to— er, do some¬ 
thing/ die station manager told him. 

The mechanic grinned, popping his gum and drawing a cigarette from the rolled-up sleeve of his 
t-shirt “I’m sorry, Dave/ he said “I cam do that/ 

“Somehow 1 knew you’d say that, hut occasionally you can get die machine to work by uttering the 
proper magical incantation/ 1 

The announcer broke in: “It's Moore s Law, Dave: ’Any sufficiently old technology is indistinguishable 
from magic/" 

“Well, we can't let this situation continue. I anticipated that we might have this problem. I hate to do 
this, but—Matthew, tell diem what you've been up to/' 

Matthew adjusted his glasses. U 1 have Forth been studying. 1 must the problem to solve die word of 
unbinding speak. We the Forth must use, Luke/ 

“It s Dave.” 

“I mean Dave. Oh, dam. Now Tve forgotten the word of unbinding." 

“I didn't think that would work, but there are other religions/ 

Over the next three weeks they tried various incantations. One Indian brought an automated prayer 
wheel that spelled out the nine billion names of got!. Didn’t work. And then the siation manager did 
whaL he had dreaded, and called in Monsignor Purvis and his Exorcist 2000. It got pretty ugly then. 
Lightning flashed in the station manager's office and lie spun a round in Ills chair three limes, spilling his 
coffee in his lap. The kserwriter in the outer office iicgan spewing gross-looking green- and white-striped 
paper, and you wouldn't believe the language dial came out of that thing. 

It was Java. 

Two weeks later, the siation manager strode out into the outer office and tapped the receptionist s cof¬ 
fee cup for attention. “Well, the exorcism has worked. And it really did everything: It resolved all the 
Year 2000 problems, ported die code to the new server, translated two million lines of Coboi into nine 
billion lines of 100 percenl pure Java, and morphed the whole schmear into a six-tiered intntnet/exmmel, 
replacing advertisers' snail mail addresses with e-mail addresses, not just in the database but even on 
their stationery and business cards. Now if we could just get it to reflect our new ad rates../ 

The kserprinter spit a piece of paper on the floor. The announcer picked it up and read it in his sten¬ 
torian tones. “It says, Tm sorry, Dave; 1 can't do that/’' 

“Well that's just great/ tile station manager said, Hopping down on the comer of the receptionist s desk 
and spilling her coffee in his lap. 

“We're back where we started/ 

“Let me give it a try/ the sultry brunette said, walking around behind the rack mounted accounting 
computer. After a moment, the machine l?egan to make moaning noises and suddenly paper liegan to 
fly out of the laser printer. 

“These rates are correct/ the sration manager said as the brunette came out from behind die machine 
blowing on her fingernails. “What did you do? n 
“i haven’t the slightest idea/ she answered enigmatically. 

“Which proves my point/ said the announcer. “It’s incomprehensible precisely because we need it. It's 
Murphy's Law, Dave: Any sufficiently crucial technology is indistinguishable from magic/" 
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